Al Jazeera Journalism Review

Outside image
(Laurence Dutton/Getty Images)

Investigative journalism in the digital age

One definition of investigative journalism – as given in UNESCO’s investigative journalism manual – says that it is about exposing the truth about public interest issues, issues whose details are kept under wraps (deliberately or otherwise) by the people involved. 

Using this definition, data and the stories extracted from it can constitute a central pillar of investigative journalism. Data constitutes the ‘raw material’ that a journalist uses to cast light into darkness, clear up ambiguities and solve apparent contradictions in their story. Investigative journalism and data-driven journalism have a big overlap. 

They both engage in in-depth research and sift through information to exclude any impurities (fake news or misleading data). Data thus plays an important role in the various stages of an investigation, including how it is presented within the story. 

You should state the importance of the data clearly within your report. You should also be careful to distinguish data from facts: data does not necessarily mean fact. There may be biases in the way data is collected, and you should always be careful to test your data and establish how it is linked to the incident you are investigating. 

When you first start working on an investigation, you should look up the data already available - whether official or unofficial - in order to answer as many of your questions as possible before moving on to posing questions whose answers are unknown and coming up with hypotheses and possibilities. 

A story begins with looking up data. You will need to have a way of collecting and presenting data, which is exactly what data-driven journalism provides. Data will help make your story measurable. It will allow you to render your "hows" as measurable "how muchs", allowing the reader to more clearly see the scope of the problem alongside the added value of any new information that you obtain from private sources of data or from data not initially included.

Let’s take a look at the different stages involved in data-driven journalism: 

Stage 1: Looking for sources 

First of all, review the available open source data relevant to the investigation, making sure – before doing anything else – that you are familiar with the frequency with which it is made available. You should also review any private, verifiable sources of data that you or your employer might gain access to. 

Open source data includes reports from the World Bank, the World Health Organisation (WHO) and the Food and Agriculture Organisation (FAO), annual government statistical reports, and social media websites.

Investigate 1
BEIJING: A reporter uses her laptop outside a press conference of the Chinese People's Political Consultative Congress, known as the CPPCC at the Great Hall of the People on March 2, 2011 in Beijing, China. [Feng Li/Getty Images]

Stage 2: Handling the data 

There are several database programmes that may be of use in handling data: 

• Microsoft Excel (spreadsheets) 

• OpenRefine (data refining) 

• Fusion Tables (verification) 

• MySQL (databases) 

• SOLR and Access (databases) 

There are several new techniques from data-driven journalism that may be of assistance in investigations: 

• Analysing data taken from the social media profiles of perpetrators or influential people. This can help you tease out lines of investigation or access information from non-traditional sources (Donald Trump’s tweets about a particular incident, for example, predating his presidency by many years). 

• Analysing audience reactions to prominent public issues. 

• Accessing historical data relevant to your investigation. For example, working out the dates on which something happened can provide you with new ways of understanding present problems (the date of a famine in a particular country with chronic water supply problems...) 

• Working out where something happened (a military operation in a particular country, for example) or a photo or video clip was taken. You can use data that has been deleted using archiving tools like Internet Archive or the Wayback Machine.

When refining large quantities of data, you can analyse and compare using a particular chronological or geographical filter. This can give your story new dimensions that may not have been immediately clear. If you go deep into the data, you may even find new stories.

In 2011, the Guardian was able to establish who was responsible for looting during rioting that had taken place across the UK in August 2011. The Reading the Riots project, conducted in cooperation with LSE, was heavily data-driven. 

iNVESTIGATE 2
LONDON: A local resident records a video with his mobile phone of burning barricades constructed by rioters in Goulton Road, Hackney on August 8, 2011 in London, England. Through its investigative reporting, the Guardian newspaper was able to identify many of those responsible for looting across the UK. [Dan Istitene/Getty Images]

The Panama Papers project drew on more than 11.5 million documents making up 2.6 terabytes of data dating from 1977 to 2015 and concerning about 214,000 corporate entities. 

The International Consortium of Investigative Journalists (ICIJ) incorporated the data into a database that makes sifting through it and searching it much easier. 

The Paradise Papers project, which likewise incorporates about 13.4 million documents obtained by the Suddeutsche Zeitung and showing how the world’s super-rich invest their money (ICIJ) 

Stage 3: Analysis 

After collating and refining the data, there are several methods you can use to analyse the data: 

• Descriptive analysis: answers the questions “what?”, “who? “, “how”, “where” and “when?” 

• Diagnostic analysis: answers the question “why?” 

• Advanced analysis making predictions about future scenarios. A successful example is provided by Noun Post’s report Golden Generals

investigate 3
PANAMA CITY: Part of the Panama City skyline is seen as revelations about the law firm Mossack Fonseca & Co continue to play out around the world on April 7, 2016. Millions of documents about the offshore activities of multiple corporate entities were leaked to, among others, investigative journalists. [Joe Raedle/Getty Images]

Stage 4: Preparing a data-driven investigation 

When putting together your story, there are various tools you can use to present data in way that is easier to understand: 

• Charts and graphs 

• Infographics 

• Interactive maps 

Tools that may be of interest include Tableau Public and Many Eyes, which will allow you to present data visually in a range of different ways, and Geocommons and Google Fusion Tables, which will allow you to produce maps using coordinates. The AJMI has produced a guidebook to data-driven journalism that provides detailed instructions on how to go about doing this.

Saving data and documents

There are various programmes you can use to store data: 

Google Drive: Google Drive is associated with your personal email. It can be used as a digital memory folder allowing you to save data. You can also work on it directly, whether through the Google Docs interface or through a Google Sheet (Excel). 

Xperia Companion: Download this programme to produce backup copies of your data. It allows you to transfer files easily from one device to another and store it safely. 

Dropbox: Dropbox allows you to keep your files safe in a cloud folder. You can then access them wherever you are in the world.

Verifying open source material

Traditional methods may seem like a better bet when trying to expose difficult facts, but the development of advanced techniques for gathering news from open source and user-generated content is playing an ever-bigger role in investigative journalism. 

In 2018, the BBC conducted an investigation in Cameroon which proved that contrary to what many had believed, government forces had been committing war crimes against civilians. The investigation took months of research and drew on a video clip taken with a mobile phone camera and published on social media showing armed men assaulting and then executing two women and two children. The video clip was verified and analysed scientifically. 

investigate 4
PEEL, ISLE OF MAN: A newspaper bill references the Paradise Papers outside a shop on November 7, 2017 in Peel, Isle of Man. The Isle of Man is a low-tax British Crown Dependency with a population of just 85 thousand, located in the Irish Sea off the west coast England. Recent revelations in the Paradise Papers, which were leaked to investigative journalists as well as others, linked the island to tax loopholes being used by Apple and Nike, as well as celebrities such as Formula One champion Lewis Hamilton. [Matt Cardy/Getty Images]

The armed men, the place where the incident took place and the type and source of the weapons used were all identified. By comparing Google Maps with the crime scene, the team were able to prove that not Boko Haram but government forces had carried out the executions, and that they had taken place not in Mali but in Cameroon. We will look at some of the details of the incident, and the digital tools used to analyse it, in more detail later on. In Sudan, BBC journalists were able to collect and review more than 300 videos shot by activists on the ground, allowing them to reconstruct a scene showing that the Rapid Response Forces had fired live ammunition on protesters in July 2019. 

In both of these cases, journalists were able to dispense with traditional methods and with teams on the ground while conducting their investigations. Thousands of videos shared on social media websites, carefully verified, were the deciding factor in the investigation. Fact-checking began with the investigation of fabricated photos or decontextualised video clips. 

But these techniques have got better and their use more sophisticated, creating a space for a new type of open source journalism. These modern techniques can be combined with traditional techniques to produce high-quality investigations. 

 

An earlier version of this article first appeared in the AJMI publication, Investigative Journalism Handbook

 

 

More Articles

"I Am Still Alive!": The Resilient Voices of Gaza's Journalists

The Israeli occupation has escalated from targeting journalists to intimidating and killing their families. Hisham Zaqqout, Al Jazeera's correspondent in Gaza talks about his experience covering the war and the delicate balance between family obligations and professional duty.

Hisham Zakkout Published on: 15 May, 2024
Under Fire: The Perilous Reality for Journalists in Gaza's War Zone

Journalists lack safety equipment and legal protection, highlighting the challenges faced by journalists in Gaza. While Israel denies responsibility for targeting journalists, the lack of international intervention leaves journalists in Gaza exposed to daily danger.

Linda Shalash
Linda Shalash Published on: 9 May, 2024
Elections and Misinformation – India Case Study

Realities are hidden behind memes and political satire in the battle for truth in the digital age. Explore how misinformation is influencing political decisions and impacting first-time voters, especially in India's 2024 elections, and how journalists fact-check and address fake news, revealing the true impact of misinformation and AI-generated content.

Safina
Safina Nabi Published on: 30 Apr, 2024
Amid Increasing Pressure, Journalists in India Practice More Self-Censorship

In a country where nearly 970 million people are participating in a crucial general election, the state of journalism in India is under scrutiny. Journalists face harassment, self-censorship, and attacks, especially under the current Modi-led government. Mainstream media also practices self-censorship to avoid repercussions. The future of journalism in India appears uncertain, but hope lies in the resilience of independent media outlets.

Hanan Zaffa
Hanan Zaffar, Jyoti Thakur Published on: 25 Apr, 2024
The Privilege and Burden of Conflict Reporting in Nigeria: Navigating the Emotional Toll

The internal struggle and moral dilemmas faced by a conflict reporter, as they grapple with the overwhelming nature of the tragedies they witness and the sense of helplessness in the face of such immense suffering. It ultimately underscores the vital role of conflict journalism in preserving historical memory and giving a voice to the voiceless.

Hauwa Shaffii Nuhu
Hauwa Shaffii Nuhu Published on: 17 Apr, 2024
Journalism in chains in Cameroon

Investigative journalists in Cameroon sometimes use treacherous means to navigate the numerous challenges that hamper the practice of their profession: the absence of the Freedom of Information Act, the criminalisation of press offenses, and the scare of the overly-broad anti-terrorism law.

Nalova Akua
Nalova Akua Published on: 12 Apr, 2024
The Perils of Journalism and the Rise of Citizen Media in Southeast Asia

Southeast Asia's media landscape is grim, with low rankings for internet and press freedom across the region. While citizen journalism has risen to fill the gaps, journalists - both professional and citizen - face significant risks due to government crackdowns and the collusion between tech companies and authorities to enable censorship and surveillance.

AJR Contributor Published on: 6 Apr, 2024
Silenced Voices: The Battle for Free Expression Amid India’s Farmer’s Protest

The Indian government's use of legal mechanisms to suppress dissenting voices and news reports raises questions about transparency and freedom of expression. The challenges faced by independent media in India indicate a broader narrative of controlling the narrative and stifling dissenting voices.

Suvrat Arora
Suvrat Arora Published on: 17 Mar, 2024
Targeting Truth: Assault on Female Journalists in Gaza

For female journalists in Palestine, celebrating international women's rights this year must take a backseat, as they continue facing the harsh realities of conflict. March 8th will carry little celebration for them, as they grapple with the severe risks of violence, mass displacement, and the vulnerability of abandonment amidst an ongoing humanitarian crisis. Their focus remains on bearing witness to human suffering and sharing stories of resilience from the frontlines, despite the personal dangers involved in their work.

Fatima Bashir
Fatima Bashir Published on: 14 Mar, 2024
A Woman's Journey Reporting on Pakistan's Thrilling Cholistan Desert Jeep Rally

A Woman's Voice in the Desert: Navigating the Spotlight

Anam Hussain
Anam Hussain Published on: 8 Mar, 2024
Breaking Barriers: The Rise of Citizen Journalists in India's Fight for Media Inclusion

Grassroots journalists from marginalized communities in India, including Dalits and Muslims, are challenging mainstream media narratives and bringing attention to underreported issues through digital outlets like The Mooknayak.

Hanan Zaffa
Hanan Zaffar, Jyoti Thakur Published on: 3 Mar, 2024
Why Journalists are Speaking out Against Western Media Bias in Reporting on Israel-Palestine

Over 1500 journalists from various US news organizations have signed an open letter criticizing the Western media's coverage of Israel's actions against Palestinians. They accuse newsrooms of dehumanizing rhetoric, bias, and the use of inflammatory language that reinforces stereotypes, lack of context, misinformation, biased language, and the focus on certain perspectives while diminishing others. They call for more accurate and critical coverage, the use of well-defined terms like "apartheid" and "ethnic cleansing," and the inclusion of Palestinian voices in reporting.

Belle de Jong journalist
Belle de Jong Published on: 26 Feb, 2024
Silenced Voices and Digital Resilience: The Case of Quds Network

Unrecognized journalists in conflict zones face serious risks to their safety and lack of support. The Quds Network, a Palestinian media outlet, has been targeted and censored, but they continue to report on the ground in Gaza. Recognition and support for independent journalists are crucial.

Yousef Abu Watfe يوسف أبو وطفة
Yousef Abu Watfeh Published on: 21 Feb, 2024
Artificial Intelligence's Potentials and Challenges in the African Media Landscape

How has the proliferation of Artificial Intelligence impacted newsroom operations, job security and regulation in the African media landscape? And how are journalists in Africa adapting to these changes?

Derick M
Derick Matsengarwodzi Published on: 18 Feb, 2024
Media Blackout on Imran Khan and PTI: Analysing Pakistan's Election Press Restrictions

Implications and response to media censorship and the deliberate absence of coverage for the popular former Prime Minister, Imran Khan, and his party, Pakistan Tehreek-e-Insaf (PTI), in the media during the 2024 elections in Pakistan.

Anam Hussain
Anam Hussain Published on: 14 Feb, 2024
Digital Battlegrounds: The New Broadcasting Bill and Independent Journalism in India

New legislation in India threatens the freedom of independent journalism. The draft Broadcasting Services (Regulation) Bill, 2023 grants the government extensive power to regulate and censor content, potentially suppressing news critical of government policies.

Safina
Safina Nabi Published on: 11 Feb, 2024
Pegasus Spyware: A Grave Threat to Journalists in Southeast Asia

The widespread deployment of spyware such as Pegasus in Southeast Asia, used by governments to target opposition leaders, activists, and journalists, presents significant challenges in countering digital surveillance. This is due to its clandestine operations and the political intricacies involved. The situation underscores the urgent need for international cooperation and heightened public awareness to address these human rights infringements.

AJR Contributor Published on: 5 Feb, 2024
Media Monopoly in Brazil: How Dominant Media Houses Control the Narrative and Stifle Criticism of Israel

An in-depth analysis exploring the concentration of media ownership in Brazil by large companies, and how this shapes public and political narratives, particularly by suppressing criticism of Israel.

Al Jazeera Logo
Rita Freire & Ahmad Al Zobi Published on: 1 Feb, 2024
Cameroonian Media Martyrs: The Intersection of Journalism and Activism

Experts and journalists in Cameroon disagree on the relationship between journalism and activism: some say journalism is activism; others think they are worlds apart, while another category says a “very thin” line separate both

Nalova Akua
Nalova Akua Published on: 28 Jan, 2024
Silent Suffering: The Impact of Sexual Harassment on African Newsrooms

Sexual harassment within newsrooms and the broader journalistic ecosystem is affecting the quality and integrity of journalistic work, ultimately impacting the organisation’s integrity and revenue.

Derick M
Derick Matsengarwodzi Published on: 23 Jan, 2024
Echos of Israeli Discourse in Latin American Media on Gaza

Heavily influenced by US and Israeli diplomatic efforts, Latin American media predominantly aligns with and amplifies the Israeli perspective. This divergence between political actions and media representation highlights the complex dynamics shaping Latin American coverage of the Gaza conflict.

Rita Freire Published on: 23 Nov, 2023
Why have opposition parties in India issued a boycott of 14 TV presenters?

Media workers in India argue that boycotts of individual journalists are not the answer to pro-Government reporting bias

Saurabh Sharma
Saurabh Sharma Published on: 23 Oct, 2023
The bombs raining down on Gaza from Israel are beyond scary, beyond crazy

REPORTER'S NOTEBOOK: As Israel bombarded Gaza for the third night, I found myself closer to a missile hit than I could have imagined

Maram
Maram Humaid Published on: 11 Oct, 2023
Reporter’s Notebook - what I learned from covering the Kalash people

As journalists, our fascination with Indigenous communities can blind us to our ethical obligations to respect privacy and dignity of those we document - we must reflect carefully

Anam Hussain
Anam Hussain Published on: 5 Oct, 2023