Category: Opinion

Six-Hour with Geeks: A Glance into Hong Kong Open Source Movement

Loads of imagination about programming has been running helter-skelter in my mind before I step into the spacious and well-polished Spectrum studio on the 11th floor of an office building at Sheung Wan in this bright Saturday morning. As someone who has been concentrating only on courses about liberal arts since senior school, I always consider coding as something far away from my daily life.

But today, Chico Xu, Ivy Wang and I, as student reporters, are going to take a glance into this sophisticated business which we once thought was irrelevant to the lives of us and the lives of many, but which actually is, and to a large extent.

The event we are attending is called the Global Pandas Documentation Sprint, a worldwide event held simultaneously in more than 300 countries on March 10, 2018, aiming at improving this Python library’s documentation with clearer explanations and better examples, and trying to leave, at the end of the day, with the library enhanced “in a perfect state,” as put by its official website.

sites
The Global Pandas Documentation Sprint was held simultaneously in more than 300 countries around the world.

Continue reading “Six-Hour with Geeks: A Glance into Hong Kong Open Source Movement”

Data News of the Week: Hong Kong Legislative By-Election 2018

The hope for pro-democracy camp to regain its veto power in the legislature vanished as Edward Yiu Chung Yim failed to beat his rival Vincent Cheng Wing-shun in last week’s by-election.

Legislative By-election 2018 was held on 11 March for four vacant seats in the council, following after the oath-taking saga which disqualified six councilors. New legislators were elected from three regions – New Territories East, Kowloon West, and Hong Kong Island, as well as Architectural, Surveying, Planning and Landscape functional constituency.

Election is all about numbers – voter turnout rate, numbers of votes, and percentage of voters supporting candidate A or B, making it a golden opportunity for data visualization. In this article, we select three local news media, which are Initium, SCMP, and HK01, to discuss their different coverages on the by-election.

Continue reading “Data News of the Week: Hong Kong Legislative By-Election 2018”

Key Takes from Jessica Lo’s Sharing on ODD-HK 2018 about Government Data Portal

Open Data Day is an annual celebration of open data all over the world. In the year of 2018, more than 400 cities simultaneously organise hackathons on Mar 3. According to one Hong Kong organiser, Bastien Douglas, most local organisers of ODD are government affiliates. In Hong Kong, communities like OSHK and ODHK lead the organisation every year. One highlight for ODD-HK-18 is the talk from Jessica Lo, the system manager from OGCIO responsible for the open data portal: data.gov.hk

Continue reading “Key Takes from Jessica Lo’s Sharing on ODD-HK 2018 about Government Data Portal”

Lightning News from Public Data Sets

It is time to break-down the broad concept of “data journalism”. When talking about the combination of data and news, we usually refer to two processes, sometimes conducted in an integral manner. One process is to discover news points from datasets. The datasets can provide a lead for further investigation. The final product does not necessarily reflect the usage of data. It may look the same as normal news products mainly composed of interviews and photos. This is called “data mining” in the science domain. Another process is to present news points using data. There come to all kinds of charts and interactive/ immersive presentations. This is called “data visualisation” in the science domain.

Let’s focus on the “data mining” part in this article. That is to discover news from datasets, or more precisely discover a news lead from datasets. The further development of the entire news story may take much more efforts with a combination of traditional and modern methods. For easier discussion, we treat “news” in the general form: something the audience does not know before reading, a.k.a, something that “appears new”. It could be the status update of a current affair, or it could be the “new knowledge” to the readers (probably be “common knowledge” to experts which we don’t want to waste time debating).

As advocated by the “Road to Jan”: the most profound theory takes the simplest form. As a first step, we try not using programming, or even sophisticated spreadsheet skills. One can readily find some “news” with a bit “nose for news” and be computer literate is good enough. In this article, we will demo a few news points mined by our undergraduate students from Hong Kong government data portal: https://data.gov.hk . It took around 20 minutes in the second class of a data journalism course. We start with a public dataset from the portal, check out the data tables and eyeball if there is anything interesting. The process is so quick that we would like to give it a brand name: Lightning News. One can sharpen his/her news sense and data sense by doing this as daily exercise.

Continue reading “Lightning News from Public Data Sets”

[Repost] Reflections on the talk by Prof Ikhlaq Sidhu on Artificial Intelligence

dataXhkbu_2018masterclass_1

Prof Ikhlaq Sidhu on the DataXHKBU workshop on 26 Jan (by Xinzhi Zhang)

Note: This post was originally contributed as an entry to the Communicar Journal Blog on 28 Jan 2018. The author would like to repost it to the D&N Society]

Reflections on the talk by Prof Ikhlaq Sidhu on Artificial Intelligence

“Will AI help the film directors to make better movies? – Yes! But will AI replace the film directors and become directors? – No!” – Prof Ikhlaq Sidhu.

Continue reading “[Repost] Reflections on the talk by Prof Ikhlaq Sidhu on Artificial Intelligence”

Tech in IJ: An observation and rethink after GIJC17

The sharing Pili Hu gave about the experience, observations and thoughts taken away from the trip to Global Investigative Journalism Conference 2017. Slides are followed. Keep reading for an outline of topics discussed in this talk.

Continue reading “Tech in IJ: An observation and rethink after GIJC17”

Curate datasets for fun, and profit

Cover photo source: raul kalvo

The term “Curator” was traditionally used in the context of museums, library, gallery and art exhibitions. It general refers to the person who creatively plan and well organise resources to maximise the utility for the audience. The process that a curator gets the job done is thus “curate”.

TLDR readers, here are the sites that worth your visit:

Continue reading “Curate datasets for fun, and profit”

賣數據?賣能力?淺談數據新聞的商業出路 — 香港網絡媒體峯會後感

【編註】本文是2017年12月4日的香港網絡媒體峯會後的感想,由 Pili Hu 口述完成,先使用科大訊飛轉爲文字稿,再通過 OpenCC 由簡體轉爲繁體,經過輕量編輯後發出。因爲是口述稿,行文難免累贅,有部分用語不準確的地方,歡迎留言指正。從技術上看,是我們一次新的嘗試,希望藉此方法加速信息傳遞的效率。總共耗時爲1小時在地鐵上口述,加1.5小時編輯。

今天下午的網絡傳媒峯會。幾位主編都有分享,很多媒體運營的經驗。大家談論最多的話題還是媒體的商業模式。用端傳媒主編張潔平的話來說,媒體的商業模式主要有三種。第一種是廣告模式,即先用內容換取流量,然後再用流量去向廣告主收費。第二種模式是直接付費,即把內容直接賣給讀者,而最好的賣法是訂閱,相比點播,訂閱可以保證持續的現金流。第三種辦法,則是贊助,一些基金會或者個人金主,都會贊助大大小小的媒體。

香港網絡媒體高峯會2017@HKICC

Continue reading “賣數據?賣能力?淺談數據新聞的商業出路 — 香港網絡媒體峯會後感”

活用多個數據庫做企業背景調查:一篇民間調查的方法解析

原文:我猜你们一定很想了解一下红黄蓝

隨著北京紅黃藍幼兒園的虐童事件的不斷發酵,微博熱搜一度出現:「三種顏色不能上熱搜」熱門話題。中國社交媒體的兩大平台「微博」和「微信公眾號」的網絡大V們仍舊不斷在針對此次事件从不同角度發表各種文章。這次小編帶大家看看一個經常以吐槽科技公司及其產品的科技自媒體「差評君」,在第一時間發表了的一篇針對「紅黃藍幼兒園經營背景」的調查,獲得超過十萬加阅读量的微信公眾號文章。

這篇文章通過網絡公開資料,以熟練運用各種搜索工具作為主要手段,呈現了一個數據調查報道的成功案例。小編進行「逆向工程」,帶大家分析一下這篇文章中使用到的數據庫和調查手段。

Continue reading “活用多個數據庫做企業背景調查:一篇民間調查的方法解析”

Learn Spreadsheet to Mine Data and Jumpstart Your Data Journalism Career – A Sharing by Aimee Edmondson

Aimee Edmondson is now an Associate Professor with Scripps School of Journalism, Ohio University. HKBU students are very lucky to have this knowledgeable and passionate speaker to talk about data journalism this afternoon. Her 12 years in reporting and later acquired statistics and technology are a fine combination for a data journalist. In the world where people are too fascinated by new technology and numerous boot camps are created by non-journalists, Aimee can be a role model for those “traditional journalists” who are moving in this direction.

Why does data matter? In Aimee’s words, you want to be a reporter, not a repeater. Data helps one to verify what the source is saying and find out what is really happening. To be pragmatic, we are seeing more and more JD requiring data analytics skills from investigative reporters. Going beyond the journalism domain, the skills trained by data journalism can well fit into corporate communication, public relation and advertising industry.

Picture: Job boards on IRE, from the slides

To start, one only needs to work on “small data”, with a spreadsheet.

Continue reading “Learn Spreadsheet to Mine Data and Jumpstart Your Data Journalism Career – A Sharing by Aimee Edmondson”