• Home
  • Event
  • Article
    • Gallery
    • Opinion
    • Resources
  • About
    • Founding Members
    • Steering Committee
    • Executive Officers
  • Newsletter

The Data & News Society

~ news/numbers; stats/stories

The Data & News Society

Author Archives: Pili Hu

Tech in IJ: An observation and rethink after GIJC17

17 Sunday Dec 2017

Posted by Pili Hu in Opinion

≈ Leave a comment

Tags

GIJC, GIJC17, Investigative Journalism, Technology

The sharing Pili Hu gave about the experience, observations and thoughts taken away from the trip to Global Investigative Journalism Conference 2017. Slides are followed. Keep reading for an outline of topics discussed in this talk.

[iframe src=”https://docs.google.com/presentation/d/e/2PACX-1vQ0-NXMaMBIHqMbu18i8h2CzURdx7VnUcRUaTtsfSQyy8svbXOAOcx5XMWaNYHUItlQIlTlBwZC7a0f/embed?start=false&loop=false&delayms=3000″%5D

Continue reading →

Curate datasets for fun, and profit

14 Thursday Dec 2017

Posted by Pili Hu in Opinion, Resources

≈ 1 Comment

Tags

Business Model, data source, database, datasets

Cover photo source: raul kalvo

The term “Curator” was traditionally used in the context of museums, library, gallery and art exhibitions. It general refers to the person who creatively plan and well organise resources to maximise the utility for the audience. The process that a curator gets the job done is thus “curate”.

TLDR readers, here are the sites that worth your visit:

  • https://public.enigma.com
  • https://www.statista.com/
  • https://datausa.io/  ; https://dataafrica.io/ ;   http://dataviva.info/en/
  • https://ourworldindata.org
  • https://www.theatlas.com/
  • http://old.datahub.io/
  • https://github.com/caesar0301/awesome-public-datasets

Continue reading →

賣數據?賣能力?淺談數據新聞的商業出路 — 香港網絡媒體峯會後感

06 Wednesday Dec 2017

Posted by Pili Hu in Opinion

≈ Leave a comment

Tags

Business Model, GIJC, GIJC17

【編註】本文是2017年12月4日的香港網絡媒體峯會後的感想,由 Pili Hu 口述完成,先使用科大訊飛轉爲文字稿,再通過 OpenCC 由簡體轉爲繁體,經過輕量編輯後發出。因爲是口述稿,行文難免累贅,有部分用語不準確的地方,歡迎留言指正。從技術上看,是我們一次新的嘗試,希望藉此方法加速信息傳遞的效率。總共耗時爲1小時在地鐵上口述,加1.5小時編輯。

今天下午的網絡傳媒峯會。幾位主編都有分享,很多媒體運營的經驗。大家談論最多的話題還是媒體的商業模式。用端傳媒主編張潔平的話來說,媒體的商業模式主要有三種。第一種是廣告模式,即先用內容換取流量,然後再用流量去向廣告主收費。第二種模式是直接付費,即把內容直接賣給讀者,而最好的賣法是訂閱,相比點播,訂閱可以保證持續的現金流。第三種辦法,則是贊助,一些基金會或者個人金主,都會贊助大大小小的媒體。

香港網絡媒體高峯會2017@HKICC

Continue reading →

Learn Spreadsheet to Mine Data and Jumpstart Your Data Journalism Career – A Sharing by Aimee Edmondson

29 Wednesday Nov 2017

Posted by Pili Hu in Opinion, Resources, Tool, Tutorial

≈ 1 Comment

Tags

data, Data Clean, Excel, GICJ17, spreadsheet

Aimee Edmondson is now an Associate Professor with Scripps School of Journalism, Ohio University. HKBU students are very lucky to have this knowledgeable and passionate speaker to talk about data journalism this afternoon. Her 12 years in reporting and later acquired statistics and technology are a fine combination for a data journalist. In the world where people are too fascinated by new technology and numerous boot camps are created by non-journalists, Aimee can be a role model for those “traditional journalists” who are moving in this direction.

Why does data matter? In Aimee’s words, you want to be a reporter, not a repeater. Data helps one to verify what the source is saying and find out what is really happening. To be pragmatic, we are seeing more and more JD requiring data analytics skills from investigative reporters. Going beyond the journalism domain, the skills trained by data journalism can well fit into corporate communication, public relation and advertising industry.

Picture: Job boards on IRE, from the slides

To start, one only needs to work on “small data”, with a spreadsheet.

Continue reading →

網絡數據包分析:從 Google Maps 獲取 Fusion Table 原始數據

26 Sunday Nov 2017

Posted by Pili Hu in Tool

≈ Leave a comment

Tags

data collection, Network Analysis, Scraping

網上經常見到使用 Google Maps 繪製的地圖,如果希望對地圖中的興趣點(Point of Interest,POI)進行二次分析,就需要得到繪製地圖背後的結構化數據。如果是使用 Google Fusion Table 繪製的地圖,可以通過網絡抓包找到 Fusion Table 的ID,進而拼接出原始地址。本文來自同學 Lam Man Kit 的投稿,僅做技術交流。數據記者在使用時,需要注意原始數據的版權。而本地的研究者也需要遵守公平使用原則。本文以 FactWire 的數據報道 「分析182個領展停車場月租收費 9成2貴過房委會同區 最大差距達1.18倍」 爲例。

圖:通過網絡抓包分析 Fusion Table 的 ID

Continue reading →

Data News of the Week | e-waste in Hong Kong

25 Saturday Nov 2017

Posted by Pili Hu in Event, Resources

≈ Leave a comment

Tags

Civic Tech, DNW, e-waste, environment, GICJ17

Cover photo credit: Monitour Project

We have a special edition for DNW this week dedicated to e-waste in Hong Kong. The notes are derived from a seminar plus brainstorm session with researchers from CUHK, HKBU, PolyU, Lingnan U, activists from Land Justice, Open Data Hong Kong, CODE4HK. This is a quick note from memory, so evidence/ statistics/ figures quoted in this note need further verification before you use them. There are enough pointers for the reader to go back the source and find direct contacts.

The news points to follow

E-waste refers to the abandoned Electric and Electronic Equipments (EEE). With the booming of ICT industry, we are witnessing more and more e-waste these days. Why should you care? Let’s cut through the news points first:

  • 75% e-waste is disappeared, as Green Peace estimates. It collects data of EEE production and calculates expected e-waste according to the lifespans of devices. Comparing this with the e-waste collection data from formal government bodies, we can see a 75% gap, meaning those are lost track
  • 97.7% e-waste in Hong Kong goes to unknown channels (figure in 2009; may change due to new recycling plant; government is trying to increase supervised channels). This may signal a large number of illegal operation, but not necessary all illegal.
  • Hong Kong used to import a large volume of e-wastes given the loophole in the legislations. Those e-wastes went to mainland China for processing. The export to China was disrupted at 2015.
  • Yards/ factories/ workshops that collect, process and dump e-wastes exist in many remote locations in Hong Kong, especially New Territory. Those locations are not easily accessible, protected by “private lands” and “gangs”, as put by Land Justice investigators.
  • Many workers in those yards are illegal immigrants, for example from mainland and South East Asia. They usually work without proper protective measures.

Continue reading →

100 Year’s Earthquake around Sichuan in D3

02 Saturday Sep 2017

Posted by Pili Hu in Gallery

≈ Leave a comment

Tags

D3, data, earthquake

[iframe src=”https://hupili.net/20170800-sichuan-earthquake-in-100years/index.html”%5D

Jiuzhaigou, a world-class scenic spot in Sichuan, China, suffered a drastic earthquake on 2017-08-08 21:19:46. People still remember the Wenchuan earthquake that caused thousands of deaths and loss of enormous properties. Does Sichuan have a lot of earthquakes in history? Which counties suffered most from the earthquakes? Pili Hu made this animated chart in D3 overnight for an overview of earthquakes happened in or around Sichuan region in the past 100 years. The data source is https://www.usgs.gov/. See the project in full page here: Sichuan Earthquake in 100 years.

近況|「端」數據

23 Wednesday Dec 2015

Posted by Pili Hu in Opinion

≈ 1 Comment

(This is a repost from “P话” http://bit.ly/1OiVjaa)

P話停更許久,很多朋友來關心過。感覺是時候報報近況。

那現在我在做什麼?——我是記者。在「端傳媒」。

說時髦一點,是「數據記者」。唬人一點,也可以自稱「數據科學家」,畢竟我還是有些許學術和技術背景。既然是「數據記者」,那我們做的自然就叫「數據新聞」了。什麼算數據新聞呢?回答這個問題,如同回答什麼是雲計算?什麼是物聯網?什麼是大數據?什麼是H5?……和任何一個新興行業一樣,buzzword的背後,有人在做乾貨,有人只是在玩弄商業詞藻。有人是興趣使然,有人只是職業憂慮——怕跟不上這一波潮流。所以,不如拋開定義,舉點例子。其實,我幾個月前寫的文章《40张图解Code for系列风潮》中,提到兩個案例即是。 Continue reading →

Newer posts →

Top Posts & Pages

  • The Setup of D&N Society
  • Create Simple Filled Map (HK) in Tableau
  • Li's family business map and spring layout analysis
  • Learn Spreadsheet to Mine Data and Jumpstart Your Data Journalism Career - A Sharing by Aimee Edmondson
  • 立法會小百科 Q&A

Recent Posts

  • A dossier of data journalism teaching strategies: Words from journalism educators worldwide
  • “中国数据可视化大赛”创作者专访:数据“几人行”
  • “Whoah, wait a minute, every reporter needs to be a data reporter”: Conversations with two generations of data journalists at the Los Angeles Times
  • Aaron Mendelson: Would numbers work with radio?
  • 首届“中国数据可视化大赛”启动-数据中的宏观和微观世界
  • Job Opportunity: Market Information Specialist at Unum Networks

Recent Comments

Unknown's avatarA quick video I made… on New towns fail to be self-cont…
Erin Chan's avatarErin Chan on Create Simple Filled Map (HK)…
National Congress: s… on “Big Data” Tells Y…
Pili Hu's avatarPili Hu on Data News of the Week | Gender…
Pili Hu's avatarPili Hu on Key Takes from Jessica Lo…

Archives

  • August 2019
  • June 2019
  • April 2019
  • March 2019
  • February 2019
  • December 2018
  • November 2018
  • October 2018
  • September 2018
  • June 2018
  • May 2018
  • April 2018
  • March 2018
  • February 2018
  • January 2018
  • December 2017
  • November 2017
  • October 2017
  • September 2017
  • August 2017
  • May 2017
  • April 2017
  • March 2017
  • February 2017
  • January 2017
  • December 2016
  • November 2016
  • October 2016
  • September 2016
  • August 2016
  • July 2016
  • June 2016
  • May 2016
  • April 2016
  • March 2016
  • February 2016
  • January 2016
  • December 2015
  • November 2015

Meta

  • Create account
  • Log in
  • Entries feed
  • Comments feed
  • WordPress.com

Recent Posts

  • A dossier of data journalism teaching strategies: Words from journalism educators worldwide
  • “中国数据可视化大赛”创作者专访:数据“几人行”
  • “Whoah, wait a minute, every reporter needs to be a data reporter”: Conversations with two generations of data journalists at the Los Angeles Times
  • Aaron Mendelson: Would numbers work with radio?
  • 首届“中国数据可视化大赛”启动-数据中的宏观和微观世界

Recent Comments

Unknown's avatarA quick video I made… on New towns fail to be self-cont…
Erin Chan's avatarErin Chan on Create Simple Filled Map (HK)…
National Congress: s… on “Big Data” Tells Y…
Pili Hu's avatarPili Hu on Data News of the Week | Gender…
Pili Hu's avatarPili Hu on Key Takes from Jessica Lo…

Archives

  • August 2019
  • June 2019
  • April 2019
  • March 2019
  • February 2019
  • December 2018
  • November 2018
  • October 2018
  • September 2018
  • June 2018
  • May 2018
  • April 2018
  • March 2018
  • February 2018
  • January 2018
  • December 2017
  • November 2017
  • October 2017
  • September 2017
  • August 2017
  • May 2017
  • April 2017
  • March 2017
  • February 2017
  • January 2017
  • December 2016
  • November 2016
  • October 2016
  • September 2016
  • August 2016
  • July 2016
  • June 2016
  • May 2016
  • April 2016
  • March 2016
  • February 2016
  • January 2016
  • December 2015
  • November 2015

Categories

  • Announcement
  • Announcements
  • Article
  • Book
  • Colloquium
  • comment
  • Event
  • Field Trip
  • Gallery
  • general
  • news story
  • Open Lecture
  • Opinion
  • Resources
  • Tool
  • Tutorial
  • Uncategorized

Meta

  • Create account
  • Log in
  • Entries feed
  • Comments feed
  • WordPress.com

Recent Posts

  • A dossier of data journalism teaching strategies: Words from journalism educators worldwide
  • “中国数据可视化大赛”创作者专访:数据“几人行”
  • “Whoah, wait a minute, every reporter needs to be a data reporter”: Conversations with two generations of data journalists at the Los Angeles Times
  • Aaron Mendelson: Would numbers work with radio?
  • 首届“中国数据可视化大赛”启动-数据中的宏观和微观世界

Recent Comments

Unknown's avatarA quick video I made… on New towns fail to be self-cont…
Erin Chan's avatarErin Chan on Create Simple Filled Map (HK)…
National Congress: s… on “Big Data” Tells Y…
Pili Hu's avatarPili Hu on Data News of the Week | Gender…
Pili Hu's avatarPili Hu on Key Takes from Jessica Lo…

Archives

  • August 2019
  • June 2019
  • April 2019
  • March 2019
  • February 2019
  • December 2018
  • November 2018
  • October 2018
  • September 2018
  • June 2018
  • May 2018
  • April 2018
  • March 2018
  • February 2018
  • January 2018
  • December 2017
  • November 2017
  • October 2017
  • September 2017
  • August 2017
  • May 2017
  • April 2017
  • March 2017
  • February 2017
  • January 2017
  • December 2016
  • November 2016
  • October 2016
  • September 2016
  • August 2016
  • July 2016
  • June 2016
  • May 2016
  • April 2016
  • March 2016
  • February 2016
  • January 2016
  • December 2015
  • November 2015

Categories

  • Announcement
  • Announcements
  • Article
  • Book
  • Colloquium
  • comment
  • Event
  • Field Trip
  • Gallery
  • general
  • news story
  • Open Lecture
  • Opinion
  • Resources
  • Tool
  • Tutorial
  • Uncategorized

Meta

  • Create account
  • Log in
  • Entries feed
  • Comments feed
  • WordPress.com

Blog at WordPress.com.

  • Subscribe Subscribed
    • The Data & News Society
    • Already have a WordPress.com account? Log in now.
    • The Data & News Society
    • Subscribe Subscribed
    • Sign up
    • Log in
    • Report this content
    • View site in Reader
    • Manage subscriptions
    • Collapse this bar
 

Loading Comments...