La Nacion Data’s team portfolio from Buenos Aires, Argentina 2019

La Nación Data is the data journalism unit from LA NACION in Argentina. We do Open Data Journalism for change, in order to better serve our audiences using data to discover and tell stories. We promote the use of open data and access to information in Argentina as evidence to keep governments accountable, enhance transparency and allow citizen´s collaboration in the process of journalism.

Much of our work and effort is invested in building datasets from scratch, cleaning, structuring and normalizing to make them accessible for our newsroom and our users in Argentina. But our objective is also to produce long-term investigations that encompass a multiplicity of topics of interest with the intention of producing a political and social impact. Here is a selection of this year´s work and strategy:

1.  The Driver’s Notebooks of Corruption. This investigation formally began on January 8, 2018, when Diego Cabot, an investigative journalist from La Nación was given eight notebooks that exposed Argentina´s greatest corruption scandal ever. We built a structured dataset from this handwritten notebooks,  we transcribed all the information into an Excel database and began analyzing each one of the records. Then, we verified and checked the information by crossing it with other multiple sources such as official and company records, corporate information, public works contracts. This verification process was used to carry out a strict process of standardization and structuring of data in which currency and descriptions were unified and names and addresses were standardized.

So far, 73 individuals have been prosecuted in this case (40 have been imprisoned and several continue and another 90 businessmen are waiting for prosecution), including a former President, the entire office of the Ministry of Planning and the main public works contractors of the country. About 40 persons repented and confessed their crimes. US$ 600 million were seized.

 

2. 2018 Youth Olympic Games in Buenos Aires investigation we manually monitor all purchases, contracts and infrastructure of the event. After analyzing the historical budget, the electronic purchase portal and requesting twice an access to public information, we were able to reconstruct the total cost of the event that exceeds 8 billion pesos and explain specifically the purposes for such an expense. Until then, the government had repeatedly refused to inform this figure and did not have the systematized information open to citizens.

 

All processed information and original sources are made available to the public for download and reuse.

3. Project At What Price?. Together with Chequeado, a Fact checker unit, and Acij, a transparency NGO, we researched, compiled and opened all public contracts and data from drug purchase processes in State. It is an observatory of health-related purchases depending on the National Government and the City of Buenos Aires. Particularly, HIV medicines, contraceptives and vaccines.

We developed a scraper to get all the documents from multiple State portals, contrast them with the data obtained through requests for access to information, depurate them and accurately structure the base and carry out an in-depth analysis. This led to two unpublished investigations: a) only 25% of the investment for drug purchases corresponds to public tenders; b) the Government reduced the delivery of contraceptives.

 


 

4. Congresoscopio : legislative transparency 2010-2019.  To promote government accountability and use data as part of citizen control and surveillance we monitor the use of public resources and institutions.

A clear example is the Congresoscopio application that we use every time a new bill is voted in Congress and where we download the forms from the official sites of the Senate and House of Representatives. Through an OCR process, we convert them to Excel format and upload them then to our admin. It is a platform developed in house where the citizens access all the legislation considered in Congress from 2010 to the present and delve into how each legislator voted. In addition, it allows the user to see how many bills each legislator submitted and the number of absences. Obviously, legislation is made available to the user.

 

On the other hand, we believe that all citizens should participate by activating the demand for public information as a democratic principle and human right. That is why we are dedicated to teach civil society and build community. We strengthen our relationship with NGOs and universities through hackathons, meetups and workshops.

5. Argentina’s Senate rejected the bill to legalize abortionCitizens followed the presentations of Senators as closely as if they were watching a suspense movie, to be on the spot was very important. La Nación reused code to build a visualization that could follow that rhythm of the vote; it was important to know the final result of the vote and the vote of each one of the Senators on real time as well. The first interactive shows the faces of those persons representing us at the Senate, some of them are unknown for the audience in general. Moreover, the interactive adds the possibility of crossing variables to understand whether his/her age, religious belief, politic orientation or university background were related to their vote.

The second interactive, published early in the morning the day after, followed the same logic as the first one, but this time it analyzed and compared the votes between both House of Representatives and the Senate (257 Representatives/72 Senators), trying to find a pattern that answered the questions that many of us asked ourselves: did senior legislators vote against the bill and the younger ones, in favor?; do the Catholic ones supported the 2 lives? Are lawyers against it? You can find the answers in the visualization

6. Open Collaboration with NGOs and Universities: the Open Statements of Assets

An example of this is the open Statement of Assets (Affidavits) website of the main officers of government. Every year we access hundreds of affidavits through FOIA requests. The problem is that most of the data is handwritten on a sheet of paper, so we organized hackathons in alliance with NGOs: Poder Ciudadano, ACIJ and Directorio Legislativo to upload all the information into an open source administrator we created for such purpose.

 

These are meetings held for several days where students and volunteers participate to build this dataset together. In total, there are already more than 4,000 published affidavits typed in our unique web site,  that are constantly reused to investigate cases of corruption or to provide information on each new Government officer. This data is presented in open formats as well.

 


7. Data Science and Journalism: keeping our team and newsroom updated, we are now focused in learning how data science can help society and journalism for good, As this happens we support or co-organized the Global Women in Data Science (WIDS) Conference in Buenos Aires 2018 and 2019. In this event journalists, data scientists, engineers and programmers meet in order to promote the inclusion of women as speakers in Data Science in different areas, industries and services in Argentina in oder to inspire more women in STEM and in technology.

 

8. Open Data and Open Government:  we actively participate in movements of open information and act as users of open data from open governmens, we share learnings and give talks in several universities: Universidad Austral, Universidad Católica Argentina, Universidad Don Torcuato Di Tella, Universidad de La Matanza, Universidad de Lomas de Zamora, and others in Latin America as well.

 

Open Data Day

Every year, we are speakers at Argentina Abierta (Open Argentina) , a federal meeting on openness and innovation to design better public services. It is attended by public officers, entrepreneurs, civil society organizations, students and journalists willing to share achievements and experiences throughout the country and beyond to continue promoting a more open and collaborative world.

Team member at La Sabana University in Bogotá Colombia

At the same time, we were media partners of the 5th International Open Data Conference (IODC) in the City of Buenos Aires. It is a meeting aimed at developing transversal thoughts on openness, transparency and citizen participation. Our team was moderator of the panel on journalism and open data and organized a media conference at the editorial office of LA NACIÓN.

 


9. Automated content, from scratch in LA NACION: in search of innovation in data collection, analysis and presentation this year, in August, we began to develop and  produce automated content from scratch and in house. In other words, 1,300 automatic articles are monthly created on several topics like: sports, prices and inflation, currency exchange rates and more. The idea is to save work and make in-depth content creation more efficient to journalists and better serve with more topics on time based on data-to text stories, for our readers.

 

For example, as regards the monetary exchange rate, an automatic extraction of real-time data is made concerning the value of the Dollar and the Euro informed on Central Bank website and the subsequent publication of articles with more than 1000 combinations of journalistic articles. All these articles are accompanied by a visualization that is also updated through an automatic process.

10. Playing with data: regarding the innovative and creative presentation of data, the aim of all our interactive pieces is to challenge the user to understand, interact and explore the topic to be covered.

 

A clear example is the coverage of the 2018 World Cup in Russia where a series of visualizations focused on the real time of the event were planned. More than 12 large productions were created related to penalties in the knockout round, a balance of the first round, live statistics, an interactive key to beat the knockout round, players’ scores, fixtures and many more. All of them were created for direct interaction with the user: draw simulator, select your players, make your World Cup forecast.

 

11. Data visualization in news stories:  the first anniversary of the disappearance of the Ara San Juan Submarine. A year of uncertainty had passed since contact with the 44 crew members was lost. Data was collected from the last messages sent by the crew members to their families and the last reports sent by ARA SAN JUAN were presented.

In addition, the route was visualized, and a 3D render was made to graph the ventilation system through which the water that damaged the batteries entered and that ended the lives of all the crew members. At the same time, special emphasis was placed on real time searching and tracking with the use of tools such as marinetraffic.com from which all the stories with data were told and all the rescue ships followed and simulated in our graphs.

Another year learning new tools, producing data journalism and enjoying every challenge with our data team.

TEAM MEMBERS:

Momi Peralta Ramos: Data Journalism Project Manager and Multimedia Development Manager

Ricardo Brom: Data engineer for investigative reporting – Data intelligence unit

Florencia Coelho: Research and training in digital media trends

Gabriela Bouret: Data analyst and data miner

Romina Colman: Data journalist and freedom of information specialist

Bianca Pallaro: Data journalist and data producer

Gabriela Miño: Librarian and Archivist

Florencia Rodríguez Altube: Data journalist and data producer

Nicolás Bases: Back and front end developer

Cristian Bertelegni: Back and front-end developer

Sin comentarios

At what price?, behind the scenes of the investigation

 AT WHAT PRICE? : A PUBLIC PROCUREMENT OBSERVATORY 

In this post we propose to gather the step by step of At what price?, a public procurement observatory created by LA NACION Data in alliance with CHEQUEADO and ACIJ, which compiles the purchases of HIV drugs, contraceptives and vaccines made by the Health Office of the argentine Government and the city of Buenos Aires.

In this first approach we focused the work on the purchases of 2016 and 2017.

Seguir leyendo

Sin comentarios

El detrás de escena de la investigación sobre los Juegos Olímpicos de la Juventud

INVESTIGACIÓN: La Ciudad desembolsará más de $8000 millones en los JJ.OO. de la Juventud

La investigación se realizó en el lapso de tres meses. En junio comenzamos a realizar un relevamiento de todas las contrataciones vinculadas a los Juegos Olímpicos de la Juventud que se registraban en el sistema electrónico de Buenos Aires Compras (BAC). Esto lo complementamos con las cuatro bases de datos publicadas en el portal Buenos Aires Data: Convenios Marco, Órdenes de Compra cerrada y Órdenes de Compra Abierta.

Captura del sistema electrónico Buenos Aires Compra

Sin embargo, la información tenía registros repetidos entre las bases, la unidad ejecutora estaba mal descrita, la información estaba incompleta y desactualizada, la unidad monetaria no estaba unificada y no había rastro de aquellas compras que figuraban por fuera de la Unidad de Compras y Contrataciones de los Juegos Olímpicos de la Juventud (UPEJOL) y que estaban destinadas al evento.

Los datos descargados y normalizados del portal Buenos Aires Data

Tras un largo esfuerzo de normalización, decidimos contrastar los montos totales con el presupuesto sancionado y ejecutado de UPEJOL. Descargamos la base presupuestaria de Buenos Aires Data y filtramos según la unidad ejecutora correspondiente hasta obtener las cifras desde 2014 hasta la actualidad. Ahí vimos que las cifras de las compras y contrataciones estaban muy por debajo de esos montos y que la información que tanto trabajo nos había llevado recopilar estaba incompleta.

Es por esto, que decidimos realizar una solicitud de acceso a la información pública en el marco de la ley porteña 104. Solicitamos todas las contrataciones destinadas a los Juegos Olímpicos de la Juventud Buenos Aires 2018, desagregadas por renglones de las contrataciones y divididas según tipo de proceso, pliego, número de expediente, descripción, objeto, razón social, CUIT del proveedor, y precio total.

La aprobación del Pedido de Acceso a la Información Pública

Dentro del plazo de la solicitud que comprende la ley, la titular de UPEJOL, Florencia Grigera nos solicitó una cita en la sede del organismo para confeccionar la estructura del pedido. A las dos semanas de ese encuentro, nos solicitan que por favor vayamos a retirar la información a la sede (pese a reiterados pedidos de que nos lo envíe vía mail). Finalmente en el lugar, nos entregaron 9 hojas papel de tamaño A3. La información no sólo estaba en formato cerrado sino que tampoco cumplía con lo establecido en el pedido de acceso. Ante las negativas de entregar la información correcta y en xls o csv, nos contactamos con María Gracia Andía, quien dirige el Organo Garante del Derecho de Acceso a la Información. Ella intervino y logramos tener acceso a PDFs.

Algunas hojas A3 entregadas por UPEJOL

Una vez que logramos convertir esos PDFs a Excel, vimos que la información que habíamos recolectado del BAC era totalmente diversa a la que nos habían entregado. Un mismo expediente tenía montos y descripciones distintas. Es decir, una misma compra que aparecía en ambos archivos costaba diferente según los documentos y había sido adjudicada a empresas distintas. De ahí que nos contactamos con la Subsecretaría de Gestión Estratégica y Calidad Institucional del Gobierno de la Ciudad. Ellos no tenían conocimiento de estas disparidades y su pusieron en contacto con UPEJOL para resolverlas.

De ahí que decidieron realizar una reunión en el Comité Olímpico Argentino con la gente de la Dirección General de Calidad Institucional, el equipo de finanzas de los Juegos Olímpicos de la Juventud y nosotros. Ahí pudimos identificarles las inconsistencias y los errores en la sistematización de la información y en consecuencia se comprometieron a averiguar y corregir las faltas.

La información que nos envío el Director General de Calidad Institucional en formato abierto

Al lapso de tres semanas después de ese encuentro, recibimos un mail del Director General de Calidad Institucional con la información actualizada en formato abierto. Pero el archivo sólo contenía las compras y contrataciones de UPEJOL y la Ejecución Presupuestaria 2014-2018. Aún nos quedaba por corroborar las contrataciones vinculadas a la infraestructura Villa y Parque Olímpico que dependen de otros órganos. Para eso, decidimos descargar la información del portal de BA Obras del Gobierno de la Ciudad que tiene geolocalizados todos los proyectos más importantes de la Ciudad y que incluye información sobre las empresas adjudicadas, descripción de la obra y montos.

Captura del portal BA OBRAS

Es así cómo logramos reconstruir de forma manual el costo total de los Juegos Olímpicos de la Juventud que supera los 8.000 millones de pesos y explicar de forma detallada a qué específicamente va a parar todo ese dinero. Hasta entonces el Gobierno se había negado de forma reiterada a informar esa cifra y no tenía la información sistematizada abierta para los ciudadanos.

Además, con la información publicada en el presupuesto comparamos lo que ese monto representa. Es decir, pudimos ver que el gasto que se requiere para la organización supera la inversión total que exige construir la obra más cara y grande de la ciudad, la del Paseo del Bajo, que costará $6000 millones. Además, $8000 millones son cuatro veces más que el presupuesto anual de un hospital como el Ricardo Gutierrez y casi diez veces más que el del Moyano.

Parte del documento oficial de la postulación que el Gobierno de la Ciudad de Buenos Aires presentó ante el COI

Por último, también accedimos a un documento sobre lo que ellos habían pautado como gasto total en su postulación inicial como candidato para ser anfitrión de los Juegos. Allí estimaron un gasto de tan sólo 231 millones de dólares a un cotización de $4,50 y una inflación anual de 5,7% de cara a 2018.

Portal de legado olímpico

Con respecto al impacto, el Gobierno de la Ciudad nos informó que a partir de nuestra solicitud lograron solucionar los errores existentes entre organismos y pudieron sistematizar y concentrar la información. Es por eso que decidieron crear un portal de legado de los Juegos Olímpicos de la Juventud con la información recolectada y así convertirse en la primera ciudad en abrir un portal de transparencia sobre un evento olímpico. Logramos negociar que la publicación del portal sea posterior a la de la investigación. Si bien en la página no se informan los montos totales y la información se encuentra diseminada en distintas bases, al menos abrieron la información para que los vecinos puedan acceder por sí mismos. 

La investigación fue presentada en formato digital, papel y el programa televisivo de Lo que el día se llevó de LN+ conducido por Francisco Olivera.

 

 

 

Sin comentarios

LA NACION DATA Website

Meet the team!

>>> lanacion.com.ar/data <<<

Project Description

LA NACION DATA website is not just a website, it´s a strategy, a project and a team. The strategy has to do with the commitment of using data to tell stories and expand the use of data, preferably open data, to activate demand of public information, in a country that just passed a FOIA law.

The internal strategy is to work in teams with journalists, tv producers and infographers so we just facilitate the data and help in the investigation or analysis, but they are the ones who domain the topic and know how to tell the stories.  Seguir leyendo

Sin comentarios

How do we make information on salaries and retirement pensions of ex-officers available?

Date of publication

23/4/2017

Link to article

https://www.lanacion.com.ar/2015545-las-pensiones-de-los-ex-presidentes-un-beneficio-con-algunas-disparidades

Other links related

https://www.lanacion.com.ar/2078653-zaffaroni-el-ex-ministro-de-la-corte-suprema-con-la-pension-mas-alta

Description of your project

LA NACION Data team explores every day about government databases and great volumes of public documents apparently “lost” on the Argentine legislation. This work allows to find not only new sources of information but also data which become important sources for news stories that have never been told.  Seguir leyendo

Sin comentarios

Life insurance of the crew members of the ARA San Juan

 

Date of publication

7/3/2018

>>> Live Link <<<

Description of the project

After the disappearance of the ARA San Juan, we began researching at LA NACION the purchases made by the Argentine Navy for the maintenance of the submarine. Without any specific hypothesis as a starting point, we cared about understanding the general context of the disappearance of the ARA San Juan. As we went deeper on this matter, we researched on the SAR (Search and Rescue) Operation and the help provided by other countries.

However, we asked ourselves what part of the whole system we were not aware of, what elements we did not consider. So, we decided to request access to information in order to know about the life insurance of the crew members. The request was made on December 6, 2017 to the Ministry of Defense. Said request included a large number of questions. The Ministry began providing the answers at longer intervals as the internal institutions provided the documents.  Seguir leyendo

Sin comentarios

LA NACION (Argentina): Open Data Journalism for change

Meet the team!

Open Data Journalism for Change

Since 2011 when LA NACION Data was launched as an open data journalism initiative, its strategy has been the same: to do data journalism AND to open data.

The vision we have is that each set of data that is published means that more knowledge is released.

The first step to make datasets famous is to become open data USERS. And if there is no open data, no problem, we need to become PRODUCERS and transform from closed formats to open data. We have to find and convert public information, so we file FOIA requests, we use more than 8 OCR engines combined with processes that automatize the feed from that closed PDF to a database adding rows in a database per month or daily, and we even type from handwritten data files. 

We also host civic marathons for opening data. Beside publishing products and services, we attended every launch of a data portal hosted by government ministries or secretaries (Open Argentina ConfScience & Technology, Justice, Agroindustry, ACUMARPrisons, province of Buenos Aires, city of Bahía Blanca, city of Pilar), NGO (Education Project) or Universities Data Lab (Di Tellaand report about it in our blog.  Seguir leyendo

Sin comentarios

LA NACION DATA TEAM (ARGENTINA)

Project Description

Since 2011 LN Data has been a strategy, a content initiative and a team whose goals are to facilitate open data journalism in LA NACION and impact for change in Argentina.

Our team has been formed by the same initial three members, Ricardo, data engineer for investigative reporting in LN Data, Flor, research and training in digital media trends and open data and Momi Peralta. We grew from there year after year. Today we are 10, including  data miner Gaby, data producer Romina, two back and front end developers Nico and Cristian, a librarian and archivist Gabriela and two journalists with data and scripting skills Carolina and Bianca 2017.

But the truth is that  we are never just 10 because our strategy is not to be a special centralized investigative unit, but a platform for all our newsroom to facilitate data, innovation, technology and skills for data journalism crossing all platforms and sections including LA NACION in its Tv channel.

Seguir leyendo

Sin comentarios