Ars Electronica Archive
www.aec.at
Info| Contact| Disclaimer
German English
PRIX PIC PRINT PRIXMAP
Starts Prize WIMA
Talks ans Lektures Art&Science
AI Lab

ARS ELECTRONICA ARCHIVE - PRIX

The Prix Ars Electronica Showcase is a collection where all the artist submissions for the Prix since 1987 can be searched and viewed. The winning projects are documented with extensive information and audio-visual media. ALL other submissions are displayed with a basic metadata in list form.

Digital Humanity Award for Digital Humanity 2022

The Data Nutrition Project

Kasia Chmielinski, Sarah Newman, Matthew Taylor
(In alphabetical order)
340096Original: 12671_LT.mp4 | 1920 * 1080px | 10m 49s | 52.0 MB
Original: 1_DNP_pipeline.jpg | 2142 * 2142px | 481.8 KB | The model development pipeline | Data Nutrition Project
Original: 2_DNP_label_over_time.jpg | 1258 * 1258px | 512.8 KB | The Dataset Nutrition Label, over time | Data Nutrition Project
Original: 3_DNP_2022_DraftLabel.jpg | 1440 * 1440px | 773.8 KB | Preview of the Dataset Nutrition Label (version 3) to be released in late 2022 | Data Nutrition Project
Original: 4_DNP_2022_Detail_UseSection.jpg | 963 * 963px | 121.7 KB | Detail of above | Data Nutrition Project
Original: 5_DNP_2022_Detail_RisksSection.jpg | 997 * 997px | 363.4 KB | Detail of above | Data Nutrition Project
Original: 6_Kasia_SXSW.jpg | 2048 * 2048px | 297.5 KB | Kasia Chmielinski, DNP Project Lead, speaking on a panel at SXSW | Data Nutrition Project
    • CATALOG TEXT
    • CREDITS
    • BIOGRAPHY
    Artificial intelligence models and algorithms are increasingly used to make decisions about people, often leading to unintended consequences, particularly for communities that are already marginalized, underserved, and underrepresented. One cause of harm is the data that is used to train the models that make these decisions. Problematic, incomplete, or otherwise biased datasets used to train these models will replicate the very issues found in the training datasets.  

    The Data Nutrition Project  is an initiative run by technologists, artists, scholars, and practitioners to enable the quick evaluation and interrogation of datasets through educational practices and a tool: the Dataset Nutrition Label. Like a nutrition label for food, but including important information about the dataset, the Dataset Nutrition Label can help mitigate harm caused by using poorly chosen data for a particular use case. In addition to building labels and a forthcoming label-maker ingestion engine, the team also works on educational initiatives, and has a children's book and a podcast that are in the works. These initiatives are intended to drive cultural awareness about algorithmic risks, as well as current interventions to improve the AI ecosystem.  

    The Labels created by the Data Nutrition Project team, which are visual, interactive, and easy to digest, are intended for practitioners and researchers to have easy access to dataset contents. Too often "found data" is shared and reused with insufficient documentation: it either has no documentation at all, the documentation is incomplete, or the documentation is provided within the context of a domain’s epistemic norms, which may be unfamiliar to data practitioners. There is no agreement on standards when it comes to dataset quality or documentation, and this is exacerbated by the "move fast" culture endemic in the tech industry, that often prioritizes rapid deployment above all else. The Data Nutrition Project  seeks to address these challenges through a user-friendly tool (the Label) and through our creative-educational work. 

    Software 

    Our project uses two software components: (1) an Ingestion Engine, which is a logic-conditional questionnaire, currently in development, that will receive information necessary for Data Nutrition Labels from dataset owners and (2) the Dataset Nutrition Label, a user experience in which data practitioners can easily view important information about a dataset and the implications of using it, based on the information submitted by dataset owners. All our software is custom built and currently all our code is open source. Our ingestion engine and newest version of the Dataset Nutrition Label (V3) will be publicly released in late 2022.
    Data Nutrition Project team:  
    Project lead: Kasia Chmielinski  
    Research lead: Sarah Newman 
    Tech lead: Matt Taylor  
    Engineer: Kemi Thomas, HG King,  
    Data science advisor: Chris Kranzinger 
    Designer: Carine Teyrouz  
    Children's book illustrator: Michael Sherman 
    Data Nutrition Project Board: Jessica Fjeld, Mary Gray, Josh Josephs, and James Mickens 

    With previous support from: The Harvard Data Science Initiative at Harvard University, CR Digital Labs, The Assembly Fellowship at the Berkman Klein Center for Internet & Society at Harvard University, and The Miami Foundation
    Kasia Chmielinski (US) is Co-Founder of the Data Nutrition Project and a technologist focused on building responsible data systems across industry, academia, government, and non-profit domains. When not thinking about data, Kasia is usually cycling or birdwatching around the Northeastern US. Sarah Newman (US) is Co-Founder of the Data Nutrition Project and the Director of Art & Education at metaLAB at Harvard. Working at the intersection of research and art, Newman's work explores technology’s role in human experience. Newman is an avid napper and seashell collector. Matthew Taylor (US) is a learning experience designer with a background in AI. Previously worked at the MIT Media Lab, and in Boston Public Schools. Currently creating curricula to demystify AI, building mutual aid networks, and organizing tech workers for social justice. Seasoned pun specialist.
    Links: https://datanutrition.org/
    https://twitter.com/makedatahealthy
    Ars Electronica Linz GmbH & Co KG Ars-Electronica-Straße 1 4040 Linz Austria
    Tel. 0043.732.7272.0 Fax. 0043.732.7272.2 Email: info@ars.electronica.art
    All Rights Reserved, 2022
    Copyright