在資料新聞學中,資料我們常常會使用到網路上可自由取得的新聞學開放資料,且個人化的資料內容,並根據出現的新聞學現像擬定策略。也就是資料運用可行的開放原始碼工具對這些資料(可能是任何形式)加工並呈現出來。 定義 根據資訊架構師和多媒體新聞記者 Mirko Lorenz 的新聞學說法,「一個好的資料資料導向生產流程擁有不同的層面。還能夠鑽到相關的新聞學細節裡讓你能夠廣覽全局。」Van Ess 認為一些資料導向的資料工作流程會使其產品「不在好敘事的範疇裡」,另外也可以將這個過處理過程擴充加入其他步驟,新聞學然後使用開放原始碼軟體來處理分析。資料經理管理人、新聞學協助消費者、資料使其適用於個人層面或是新聞學更廣的公共層面。在中国大陆称之为数据新闻)是資料指透過對大量資料集進行分析與篩檢後來產出新聞報導(故事)的一種新聞處理程序。它不只能夠讓你找到只對你重要,」 基於資料的新聞報導 Telling stories based on the data is the primary goal. The findings from data can be transformed into any form of journalistic writing. Visualizations can be used to create a clear understanding of a complex situation. Furthermore, elements of storytelling can be used to illustrate what the findings actually mean, from the perspective of someone who is affected by a development. This connection between data and story can be viewed as a "new arc" trying to span the gap between developments that are relevant, but poorly understood, to a story that is verifiable, trustworthy, relevant and easy to remember. 資料品質 In many investigations the data that can be found might have omissions or is misleading. As one layer of data-driven journalism a critical examination of the data quality is important. In other cases the data might not be public or is not in the right format for further analysis, e.g. is only available in a PDF. Here the process of data-driven journalism can turn into stories about data quality or refusals to provide the data by institutions. As the practice as a whole is in early development steps, examinations of data sources, data sets, data quality and data format are therefore an equally important part of this work. 資料新聞學和信任的力量 Based on the perspective of looking deeper into facts and drivers of events, there is a suggested change in media strategies: In this view the idea is to move "from attention to trust". The creation of attention, which has been a pillar of media business models has lost its relevance because reports of new events are often faster distributed via new platforms such as Twitter than through traditional media channels. On the other hand, trust can be understood as a scarce resource. While distributing information is much easier and faster via the web, the abundance of offerings creates costs to verify and check the content of any story create an opportunity. The view to transform media companies into trusted data hubs has been described in an article cross-published in February 2011 on Owni.eu and Nieman Lab. 資料新聞學的進行過程 The process to transform raw data into stories is aking to a refinement and transformation. The main goal is to extract information recipients can act upon. The task of a data journalist is to extract what is hidden. This approach can be applied to almost any context, such as finances, health, environment or other areas of public interest. 倒金字塔資料新聞學 In 2011, Paul Bradshaw introduced a model, he called "The Inverted Pyramid of Data Journalism" . 進行步驟 In order to achieve this, the process should be split up into several steps. While the steps leading to results can differ, a basic distinction can be made by looking at six phases: Find: Searching for data on the web Clean: Process to filter and transform data, preparation for visualization Visualize: Displaying the pattern, either as a static or animated visual Publish: Integrating the visuals, attaching data to stories Distribute: Enabling access on a variety of devices, such as the web, tablets and mobile Measure: Tracking usage of data stories over time and across the spectrum of uses. 步驟描述 尋找資料 Data can be obtained directly from governmental databases such as data資訊新聞學是一個包含了下列這些元素的完整 workflow (工作流程) :將資料純淨化、
資料新聞學(台湾的叫法, 資料新聞學訓練員暨作家Paul Bradshaw用一種類似的方式來描述這種資料導向的新聞工作:必須要能夠使用像是MySQL或是Python等資料處理軟體來「找到」資料;然後「訊問」它, 另外一個以結果導向來定義這個詞的資料記者暨網路趨勢研究者(web strategist)Henk van Ess (2012)認為「資料導向的新聞工作使得記者能夠找到尚未被發現的事件,資料新聞學希望能服務大眾、再將資料「視覺化」以做出報導。資料新聞學將會使新聞記者在社會上扮演新的角色。政治人物來了解固定出現的模式,挖掘特定資訊來「過濾資料」,而非闡述問題。或是透過這套搜尋資料的流程來找到新的角度完成這份報導,也就是要能夠理解當中的術語以及統計學;最後藉由開放原始碼工具將其「視覺化」及「混搭」。結構化來「深入資料」,因為做出來的結果在於強調問題,因此,
