Learn Spreadsheet to Mine Data and Jumpstart Your Data Journalism Career – A Sharing by Aimee Edmondson

29 Wednesday Nov 2017

Posted by Pili Hu in Opinion, Resources, Tool, Tutorial

Tags

data, Data Clean, Excel, GICJ17, spreadsheet

Aimee Edmondson is now an Associate Professor with Scripps School of Journalism, Ohio University. HKBU students are very lucky to have this knowledgeable and passionate speaker to talk about data journalism this afternoon. Her 12 years in reporting and later acquired statistics and technology are a fine combination for a data journalist. In the world where people are too fascinated by new technology and numerous boot camps are created by non-journalists, Aimee can be a role model for those “traditional journalists” who are moving in this direction.

Why does data matter? In Aimee’s words, you want to be a reporter, not a repeater. Data helps one to verify what the source is saying and find out what is really happening. To be pragmatic, we are seeing more and more JD requiring data analytics skills from investigative reporters. Going beyond the journalism domain, the skills trained by data journalism can well fit into corporate communication, public relation and advertising industry.

Picture: Job boards on IRE, from the slides

To start, one only needs to work on “small data”, with a spreadsheet.

Continue reading →

Public Sharing: What can non-coders achieve in 3 months?

28 Tuesday Nov 2017

Posted by Bobo Wei in Announcements, Gallery

≈ Leave a comment

Tags

data, Data Vis, Open Lecture

Title: What can non-coders achieve in 3 months? – A sharing of three journalism students from JOUR2106 Course.

Time: Nov 29 (Wed) ,3:30 p.m.-5:00 p.m.

Venue: CVA 1024

Introduction:

Some students are curious about the outcome of JOUR2106 Data Visualization for News. Since there is a lot of interest, we will hold a DNN event and invite three students who took JOUR2106 last year to introduce their projects and share a personal experience. Registration: No need for sign-up in prior.

Continue reading →

網絡數據包分析：從 Google Maps 獲取 Fusion Table 原始數據

26 Sunday Nov 2017

Posted by Pili Hu in Tool

≈ Leave a comment

Tags

data collection, Network Analysis, Scraping

網上經常見到使用 Google Maps 繪製的地圖，如果希望對地圖中的興趣點（Point of Interest，POI）進行二次分析，就需要得到繪製地圖背後的結構化數據。如果是使用 Google Fusion Table 繪製的地圖，可以通過網絡抓包找到 Fusion Table 的ID，進而拼接出原始地址。本文來自同學 Lam Man Kit 的投稿，僅做技術交流。數據記者在使用時，需要注意原始數據的版權。而本地的研究者也需要遵守公平使用原則。本文以 FactWire 的數據報道「分析182個領展停車場月租收費 9成2貴過房委會同區最大差距達1.18倍」爲例。

圖：通過網絡抓包分析 Fusion Table 的 ID

Continue reading →

Data News of the Week | e-waste in Hong Kong

25 Saturday Nov 2017

Posted by Pili Hu in Event, Resources

≈ Leave a comment

Tags

Civic Tech, DNW, e-waste, environment, GICJ17

Cover photo credit: Monitour Project

We have a special edition for DNW this week dedicated to e-waste in Hong Kong. The notes are derived from a seminar plus brainstorm session with researchers from CUHK, HKBU, PolyU, Lingnan U, activists from Land Justice, Open Data Hong Kong, CODE4HK. This is a quick note from memory, so evidence/ statistics/ figures quoted in this note need further verification before you use them. There are enough pointers for the reader to go back the source and find direct contacts.

The news points to follow

E-waste refers to the abandoned Electric and Electronic Equipments (EEE). With the booming of ICT industry, we are witnessing more and more e-waste these days. Why should you care? Let’s cut through the news points first:

75% e-waste is disappeared, as Green Peace estimates. It collects data of EEE production and calculates expected e-waste according to the lifespans of devices. Comparing this with the e-waste collection data from formal government bodies, we can see a 75% gap, meaning those are lost track
97.7% e-waste in Hong Kong goes to unknown channels (figure in 2009; may change due to new recycling plant; government is trying to increase supervised channels). This may signal a large number of illegal operation, but not necessary all illegal.
Hong Kong used to import a large volume of e-wastes given the loophole in the legislations. Those e-wastes went to mainland China for processing. The export to China was disrupted at 2015.
Yards/ factories/ workshops that collect, process and dump e-wastes exist in many remote locations in Hong Kong, especially New Territory. Those locations are not easily accessible, protected by “private lands” and “gangs”, as put by Land Justice investigators.
Many workers in those yards are illegal immigrants, for example from mainland and South East Asia. They usually work without proper protective measures.

Continue reading →

Data News of the Week | Paradise Papers

22 Wednesday Nov 2017

Posted by Bobo Wei in general, news story, Resources

≈ 2 Comments

Tags

data, Data Vis, DNW, news

Do you still remember the massive Panama Paper leak in 2016? When 13.4 million financial documents were released in this November, the offshore paradise islands got global attention again. Paradise Papers cover the time period from 1950 to 2016, including the more than 120,000 people and 25,000 offshore companies.

Tech-savvy readers can jump to the database directly. Like before, the dataset is modelled as a graph, namely treating the Officers, Intermediaries and Addresses as nodes and their relationships as links. Neo4j is one widely adopted graph database. Its web user interface, called “neo4j browser”, allows journalists to visually expand and explore a graph. The query language “Cypher” is a superset of relational query (SQL), full-text search and graph pattern matching. Its flexibility and built-in graph algorithms allow experienced journalists to systematically study the underlying graph. The download page on ICIJ includes snapshots of four neo4j databases exported in CSV format.

Continue reading →

Data News of the Week | North Korea Tensions

12 Sunday Nov 2017

Posted by Bobo Wei in news story, Resources

≈ 1 Comment

Tags

data, Data Vis, DNW

“North Korea” or “Democratic People’s Republic of Korea (DPRK)” are recurrent and frequent headlines in the newspapers. The recent advances in missile technology and nuclear tests threatens the world and creates a lot of geopolitical tensions. Our editor would like to share relevant data projects this week.

The “wholesale” packages

Assuming you are too busy to study all the background information and catch up the latest news, here are two must-read projects that get you up to date in 30 minutes.

☞ Immersive reporting from ESRI StoryMaps: side by side comparison of two Koreas in multiple angles [Link]

Continue reading →

Embedding interactive rich media on WordPress

07 Tuesday Nov 2017

Posted by jadeimaginist in Tool, Tutorial

≈ 2 Comments

Tags

canva, Google Chart, infogram, Kumu, StoryMapJS, Tableau, TimelineJS

Source: Wiki Commons

There are a lot “one-click” tools available online that help you to create good visualisation and export to iframe for embedding into your site. Good use of those tools can better present your content to the readers. Note that the free version of WordPress hosted service does not allow embedding iframe, so they can only rely on shortcodes. For example, one can use is to embed interactive charts generated from Google Sheets. See more options of available shortcodes for free version here

Data and News Society is operated on a paid plan so we installed the iframe plugin. This makes it possible to enable a wide range of 3rd party visualisation into your project. This tutorial is contributed by Jade Li to demo how to embed interactive content from several common tools. The general workflow is to first export the 3rd party project as iframe, find the URL in the src=”” section, and use [ iframe src=”” ] to embed it into WordPress.

Continue reading →

5 Signs to Indicate the Future of Hong Kong Media Is Bright

07 Tuesday Nov 2017

Posted by jessiepyt in Opinion

≈ 1 Comment

It’s almost the end of 2017 and perhaps it’s the right time to predict the future of Hong Kong media. Although many might think the industry enters another “dark age”, it is not as worse as the majority think. We observed some promising trends of Hong Kong media landscape this year:

Continue reading →

Recap of Oct 2017 Data Journalism Bootcamp in HKBU

07 Tuesday Nov 2017

Posted by Bobo Wei in Event, Resources, Tool

≈ Leave a comment

Tags

Bootcamp, data, Data Journalism, Data Vis

The 2-day Data Journalism Boot Camp was successfully held in HKBU on Oct 26 and Oct 27. The event was sponsored by KAS and the workshop sessions were led by two experienced trainers from DataLEADS. Another highlight of the event was a roundtable discussion chaired by Prof. Ying Chen, where professionals shared their practices, challenges and solutions in the newsrooms.

Data Bootcamp in Oct 2017

Continue reading →

wget最簡爬蟲：一行命令助攻調查記者

06 Monday Nov 2017

Posted by Bobo Wei in Resources, Tool

≈ Leave a comment

Tags

crawler, 爬蟲, data collection, scraper, wget

書寫爬蟲已經成爲數據記者的必備技能。雖然有諸如ScrapingHub、Morph、ParseHub等在線服務，可以一定程度上實現無代碼抓取網頁，但很多時候，還是需要手動編寫爬蟲邏輯。爬蟲書寫分爲兩個部分，第一個是爬，第二個是取。「爬」即是從一個網頁出發，找到它所包含的鏈接，逐一訪問，不斷重複這個過程，最終收穫到需要的頁面。這個過程和人們瀏覽網頁是類似的，有種「順藤摸瓜」的意思。「取」則是從網頁中提取有效信息的過程，將「半結構化」的網頁，轉換爲「結構化」的數據表格。

本文介紹最簡單的爬蟲，只需要一行命令： wget -r

Continue reading →

	A quick video I made… on New towns fail to be self-cont…
	Erin Chan on Create Simple Filled Map (HK)…
	National Congress: s… on “Big Data” Tells Y…
	Pili Hu on Data News of the Week \| Gender…
	Pili Hu on Key Takes from Jessica Lo…

The Data & News Society

~ news/numbers; stats/stories

Monthly Archives: November 2017

Learn Spreadsheet to Mine Data and Jumpstart Your Data Journalism Career – A Sharing by Aimee Edmondson

Public Sharing: What can non-coders achieve in 3 months?

Title: What can non-coders achieve in 3 months? – A sharing of three journalism students from JOUR2106 Course.

Time: Nov 29 (Wed) ,3:30 p.m.-5:00 p.m.

Venue: CVA 1024

Introduction:

網絡數據包分析：從 Google Maps 獲取 Fusion Table 原始數據

Data News of the Week | e-waste in Hong Kong

The news points to follow

Data News of the Week | Paradise Papers

Data News of the Week | North Korea Tensions

The “wholesale” packages

☞ Immersive reporting from ESRI StoryMaps: side by side comparison of two Koreas in multiple angles [Link]

Embedding interactive rich media on WordPress

5 Signs to Indicate the Future of Hong Kong Media Is Bright

Recap of Oct 2017 Data Journalism Bootcamp in HKBU

wget最簡爬蟲：一行命令助攻調查記者