Author: Bobo Wei

I am a Data News Project Assistant at Hong Kong Baptist University.Graduated from Business and Financial Journalism(M.A.).
As a Journalist, I like to record what happened with words and photos. I always believe that if everyone be kind to one and other that will make the world better. Moreover I believe that every person has their own story which most people don’t really understand. So if you want to share your life I am willing to listen your own story.

活用多個數據庫做企業背景調查:一篇民間調查的方法解析

原文:我猜你们一定很想了解一下红黄蓝

隨著北京紅黃藍幼兒園的虐童事件的不斷發酵,微博熱搜一度出現:「三種顏色不能上熱搜」熱門話題。中國社交媒體的兩大平台「微博」和「微信公眾號」的網絡大V們仍舊不斷在針對此次事件从不同角度發表各種文章。這次小編帶大家看看一個經常以吐槽科技公司及其產品的科技自媒體「差評君」,在第一時間發表了的一篇針對「紅黃藍幼兒園經營背景」的調查,獲得超過十萬加阅读量的微信公眾號文章。

這篇文章通過網絡公開資料,以熟練運用各種搜索工具作為主要手段,呈現了一個數據調查報道的成功案例。小編進行「逆向工程」,帶大家分析一下這篇文章中使用到的數據庫和調查手段。

Continue reading “活用多個數據庫做企業背景調查:一篇民間調查的方法解析”

Public Sharing: What can non-coders achieve in 3 months?

Title: What can non-coders achieve in 3 months? – A sharing of three journalism students from JOUR2106 Course.

Time: Nov 29 (Wed) ,3:30 p.m.-5:00 p.m.

Venue: CVA 1024

Introduction:

Some students are curious about the outcome of JOUR2106 Data Visualization for News. Since there is a lot of interest, we will hold a DNN event and invite three students who took JOUR2106 last year to introduce their projects and share a personal experience. Registration: No need for sign-up in prior.

Continue reading “Public Sharing: What can non-coders achieve in 3 months?”

Data News of the Week | Paradise Papers

Do you still remember the massive Panama Paper leak in 2016? When 13.4 million financial documents were released in this November, the offshore paradise islands got global attention again. Paradise Papers cover the time period from 1950 to 2016, including the more than 120,000 people and 25,000 offshore companies.

Tech-savvy readers can jump to the database directly. Like before, the dataset is modelled as a graph, namely treating the Officers, Intermediaries and Addresses as nodes and their relationships as links. Neo4j is one widely adopted graph database. Its web user interface, called “neo4j browser”, allows journalists to visually expand and explore a graph. The query language “Cypher” is a superset of relational query (SQL), full-text search and graph pattern matching. Its flexibility and built-in graph algorithms allow experienced journalists to systematically study the underlying graph. The download page on ICIJ includes snapshots of four neo4j databases exported in CSV format.

Continue reading “Data News of the Week | Paradise Papers”

Data News of the Week | North Korea Tensions

“North Korea” or “Democratic People’s Republic of Korea (DPRK)” are recurrent and frequent headlines in the newspapers. The recent advances in missile technology and nuclear tests threatens the world and creates a lot of geopolitical tensions. Our editor would like to share relevant data projects this week.

The “wholesale” packages

Assuming you are too busy to study all the background information and catch up the latest news, here are two must-read projects that get you up to date in 30 minutes.

☞ Immersive reporting from ESRI StoryMaps: side by side comparison of two Koreas in multiple angles [Link]

image2 Continue reading “Data News of the Week | North Korea Tensions”

Recap of Oct 2017 Data Journalism Bootcamp in HKBU

The 2-day Data Journalism Boot Camp was successfully held in HKBU on Oct 26 and Oct 27. The event was sponsored by KAS and the workshop sessions were led by two experienced trainers from DataLEADS. Another highlight of the event was a roundtable discussion chaired by Prof. Ying Chen, where professionals shared their practices, challenges and solutions in the newsrooms.

Data Bootcamp in Oct 2017

Continue reading “Recap of Oct 2017 Data Journalism Bootcamp in HKBU”

wget最簡爬蟲:一行命令助攻調查記者

書寫爬蟲已經成爲數據記者的必備技能。雖然有諸如ScrapingHub、Morph、ParseHub等在線服務,可以一定程度上實現無代碼抓取網頁,但很多時候,還是需要手動編寫爬蟲邏輯。爬蟲書寫分爲兩個部分,第一個是爬,第二個是取。「爬」即是從一個網頁出發,找到它所包含的鏈接,逐一訪問,不斷重複這個過程,最終收穫到需要的頁面。這個過程和人們瀏覽網頁是類似的,有種「順藤摸瓜」的意思。「取」則是從網頁中提取有效信息的過程,將「半結構化」的網頁,轉換爲「結構化」的數據表格。

本文介紹最簡單的爬蟲,只需要一行命令: wget -r

Continue reading “wget最簡爬蟲:一行命令助攻調查記者”

Data Journalism Open Lecture on November 11.

Shan HE, Project director of Greenovation Hub, Public Lab Organizer, member of Guangzhou International Dragon Boat team. Living in Changzhou island in Guangzhou, is coming to Hong Kong Baptist University on 11 November to talk about Environment Issue, Open Technology and Data Visualisation.

When: 11:30 AM-12:10 PM, Nov. 11, 2017 (Saturday).

Where: Room 703, 7/F, Communication and Visual Arts Building, Hong Kong Baptist University.

Continue reading “Data Journalism Open Lecture on November 11.”

Data News of the Week | Power in China

The closing session of 19th National Congress of the Communist Party of China finished this week. New Politburo Standing Committee presented to the Media, putting Beijing in the centre of world attention. This DNW hand-picks recent data news related to Power in China.

25 year’s political path to Power in China [Link]

Bloomberg Politics made an unconventional data visualisation to show The Path to Power in China. Readers can easily tell running a Big Region is important in China, by reading the following line chart. The chart successfully turned categorical position data into ordinal data by sorting the importance, namely number of people who entered Standing Committee from that position.

1

Continue reading “Data News of the Week | Power in China”

Data News of the Week | Nobel Prize and Hong Kong Chief Executive Policy Address

Noble Prize and Policy Address 2017.

Nobel Prize. Michael Greshko from National Geographic finds out that Nearly 900 People Have Won Nobel Prizes. Only 48 Were Women. The gender gap exists, regardless that female winners are increasing in recent years.

 

Nobel Prize

Continue reading “Data News of the Week | Nobel Prize and Hong Kong Chief Executive Policy Address”

Data Journalism Boot Camp in Hong Kong

Dear all,

We are honoured to introduce you a one and a half day Data Journalism Boot Camp in Hong Kong.

Location (Map): Room 506, 5/F, Communication and Visual Arts Building (CVA), Hong Kong Baptist University. 

Time: Oct. 26 (Thursday) 9am-5pm & Oct. 27 (Friday) 9am-3pm.

Interested applicants should fill in the online form in this (deadline for applications to be 13 Oct.) link: https://www.surveygizmo.com/s3/3864735/Data-Journalism-Hong-Kong

The workshop will be run by a pair experienced trainers from DataLEADS including the founder and Editor-in-Chief of the Indian Centre for Investigative Journalism.

1507518636(1).png

Continue reading “Data Journalism Boot Camp in Hong Kong”