Editing Mika/Temp/WikiFCD
From WikiDotMako
The edit can be undone. Please check the comparison below to verify that this is what you want to do, and then publish the changes below to finish undoing the edit.
Latest revision | Your text | ||
Line 50: | Line 50: | ||
<!-- Please write your response below --> | <!-- Please write your response below --> | ||
Food composition data (FCD) are an essential part of nutrition research. FCD provide nutrient data for both processed (e.g. veggie burger) and unprocessed (e.g. apples) food. These information are often compiled by a governmental agency (e.g. U.S. Department of Agriculture), based on lab measurements by the agency or companies. Many FCDs are available online, although they come in many different formats (e.g. PDF, CSV). | |||
There are vast variations in food that are commonly consumed across regions. Nutrients of unprocessed food can also vary even for the same item for a variety of reasons including changes in terroir. Area- and time-specific data are key to understanding nutrition and health. Additionally, some countries lack these data or only have data that are out of date, leading to disparities in data and ultimately, in scientific evidence in health research. | |||
Despite several attempts by research institutes and intergovernmental agencies to create a global FCD in the past, none has succeeded in developing a universally accessible, up-to-date, easy-to-use, and comprehensive global FCD. The wiki system has a potential to bring a better solution to this problem. We propose WikiFCD to compile nutrition data for unprocessed food that are published and already available online. These existing databases come from diverse settings, which becomes costly and difficult to maintain if the contributors are limited to small sets of researchers and employees in this field. The need for diverse participants is very much in line with the missions of projects supported by Wikimedia Foundation and we hope to show how peer production can contribute to improvement in knowledge disparities in global nutrition. | |||
===What is your solution to this problem?=== | ===What is your solution to this problem?=== | ||
Line 63: | Line 63: | ||
<!-- Please write your response below --> | <!-- Please write your response below --> | ||
# We will create a wikibase for food composition data from around the world. We will populate the wikibase with nutrition information. We will write schemas to describe our data model. We will map our properties to Wikidata properties. | |||
# Why is this a good idea? | |||
First, this database can contribute to a significant advancement in nutritional research as this wikibase system will improve usability of FCD from different sources, identify and borrow most appropriate data in places where up-to-date FCD are not readily available, and open up new research questions to explore more nuanced nutrition data (e.g. changes in nutrient content of the same product, depending on the climate conditions of the year). | |||
Secondly, it is important to put this data into Wikibase because this dataset is relevant for people from many language communities. The design of Wikibase will allow us to more easily support additional languages in the data itself, as well as in user interfaces. | |||
By creating an instance of Wikibase for this project we will be able to design our own data models to incorporate data from heterogeneous data sources. If subsets of this data are appropriate for Wikidata, we will be able to provide machine-actionable ShEx schemas that will help us prepare data for other systems. In this way the data will be readily-available for incorporation into Wikidata if desired. | |||
==Project goals== | ==Project goals== | ||
Line 86: | Line 77: | ||
<!-- Please write your response below --> | <!-- Please write your response below --> | ||
''' Goal #1: We will build | ''' Goal #1: We will build a wikibase where food composition data from around the world and from different time points can be entered, maintained, and retrieved. ''' | ||
''' Goal #2: We will | ''' Goal #2: We will involve participants from diverse communities to make sure that all available data are accommodated in this database.''' | ||
''' Goal #3: We will | ''' Goal #3: We will translate and link the data into other languages.''' | ||
== Project impact == | == Project impact == | ||
Line 104: | Line 95: | ||
<!-- Please write your response below --> | <!-- Please write your response below --> | ||
# Proof-of-concept | |||
## We will use 10 databases listed in [http://www.fao.org/infoods/infoods/tables-and-databases/en/| FAO/INFOODS] to test if our schema is appropriate to accommodate various information included in databases from different places. | |||
## Once the project is over, other databases not included on FAO/INFOODS can be entered, following the examples we develop in this project. | |||
# Methodology | |||
## We will develop a tutorial and documentation for editathon participants to follow. | |||
## Once the project is over, these tutorials and documentation can be used by future participants to enter and maintain the database. | |||
# Alignment with WMF strategy | |||
One of the elements of Wikimedia’s strategy focuses on “Knowledge equity”, which includes “communities that have been left out by structures of power and privilege”. | |||
Supporting multiple language communities serves this purpose, as food composition databases are more common in English and languages spoken in the EU. | |||
=== Do you have any goals around participation or content? === | === Do you have any goals around participation or content? === | ||
Line 121: | Line 113: | ||
<!-- Please write your response below --> | <!-- Please write your response below --> | ||
# 50 participants covering at least 5 languages. | |||
# 50 participants covering at least | # 10 new data sources. | ||
# 10 | |||
==Project plan== | ==Project plan== | ||
Line 134: | Line 124: | ||
;System development | ;System development | ||
# Description - We will use the docker image of Wikibase created by WMDE [https://github.com/wmde/wikibase-docker]. | # Description - We will use the docker image of Wikibase created by WMDE [https://github.com/wmde/wikibase-docker]. We will use ShEx to express the schemas for our data models. We will use QuickStatements as well as custom bots developed using the WikidataIntegrator python library to populate the Wikibase. | ||
# Outputs - | # Outputs - | ||
;Editathon (Data entry and translation) | |||
# Description - The proof-of-concept databases will be entered into WikiFCD during five monthly Editathon events hosted in Seattle. | |||
# Outputs - FCD in Wikibase. | |||
;Editathon (Data entry | |||
# Description - | |||
;Documentation | ;Documentation | ||
#Description - The process of finding datasets, identifying meta-data (e.g. copyright, year of publication), entering data, translating data, and using data for analyses will be documented. | #Description - The process of finding datasets, identifying meta-data (e.g. copyright, year of publication), entering data, translating data, and using data for analyses will be documented. | ||
#Outputs - We will generate multiple ShEx schemas that will help us communicate our data model to stakeholders. We will write a tutorial for users of the system. We will write federated SPARQL queries that others may reuse that demonstrate how to combine WikiFCD data with data from Wikidata. | #Outputs - We will generate multiple ShEx schemas that will help us communicate our data model to stakeholders. We will write a tutorial for users of the system. We will write federated SPARQL queries that others may reuse that demonstrate how to combine WikiFCD data with data from Wikidata. | ||
;Communication | ;Communication | ||
#Description - Promotion of project outputs, feedback gathering, presentation at Wikimania and nutrition workshops, tutoring of interested volunteers | #Description - Promotion of project outputs, feedback gathering, presentation at Wikimania and nutrition workshops, tutoring of interested volunteers | ||
#Outputs - Blog posts, feedback reports, ShEx schemas | #Outputs - Blog posts, feedback reports, ShEx schemas | ||
;Project management | ;Project management | ||
#Description - We will report our progress twice in 12 months. | #Description - We will report our progress twice in 12 months. | ||
Line 167: | Line 140: | ||
{| class="wikitable" | {| class="wikitable" | ||
! WP/Month !! 1 !! 2 !! 3 !! 4 !! 5 !! 6 !! 7 !! 8 !! 9 !! 10 !! 11 !! 12 | ! WP/Month !! 1 !! 2 !! 3 !! 4 !! 5 !! 6 !! 7 !! 8 !! 9 !! 10 !! 11 !! 12 !! | ||
|- | |- | ||
| WP1 - System development || X || X || X || | | WP1 - System development || X || X || X || || || || || | ||
|- | |- | ||
| WP2 - | | WP2 - Editathon || || X || X || X || X || X || || | ||
|- | |- | ||
| WP3 - | | WP3 - Documentation || || || || || X || X || X || X | ||
|- | |- | ||
| WP4 - | | WP4 - Communication || || || x || X || X || X || X || X | ||
|- | |- | ||
| WP5 | | WP5 - Project management || X || X || X || X || X || X || X || X | ||
|} | |} | ||
Line 185: | Line 156: | ||
''How you will use the funds you are requesting? List bullet points for each expense. (You can create a table later if needed.) Don’t forget to include a total amount, and update this amount in the Probox at the top of your page too!''<br/><br/> | ''How you will use the funds you are requesting? List bullet points for each expense. (You can create a table later if needed.) Don’t forget to include a total amount, and update this amount in the Probox at the top of your page too!''<br/><br/> | ||
<!-- Please write your response below --> | <!-- Please write your response below --> | ||
Our budget includes the costs of two | Our budget includes the costs of two people involved with the project data models and software engineering, for the duration of the project. These positions will be filled by the grantees. The positions are split as follows: One person working on | ||
Our team includes a software engineer to make sure that the development of the project is performant. | |||
For dissemination reasons, we are planning to visit Wikimania, to talk with editors in person about their needs and wishes for the tool. Wikimania is the right venue for this, as it will have a large pool of editors from different Wikipedia language versions. | |||
Further, for more outreach, we plan to organize events for communities that are under-resourced in Wikipedia, such as editathons. | |||
{| class="wikitable" | {| class="wikitable" | ||
! Item !! Budget | ! Item !! Budget | ||
|- | |- | ||
| Data scientist ( | | Data scientist (x hours per week for 8 months) || $ | ||
| | |||
| | |||
|- | |- | ||
| | | Software engineer (x hours per week for 8 months)|| $ | ||
|- | |- | ||
| | | Community outreach intern (x hours per week for 8 months) || $ | ||
|- | |- | ||
| | | Server hosting || $ | ||
|- | |- | ||
| Travel (Wikimania 2020, 2 people) || $ 4,000 | | Travel (Wikimania 2020, 2 people) || $ 4,000 | ||
|- | |- | ||
| Total || $ | | Total || $ | ||
|} | |} | ||
Line 212: | Line 184: | ||
<!-- Please write your response below --> | <!-- Please write your response below --> | ||
* Wikipedian communities in Seattle and in India | * Wikipedian communities in Seattle and in India | ||
* Academic nutrition communities | * Academic nutrition communities | ||
* We will host a workshop at Wikimania 2020. | * We will host a workshop at Wikimania 2020. | ||
* We will share our data models via ShEx schemas | * We will share our data models via ShEx schemas | ||
Line 223: | Line 195: | ||
''Please use this section to tell us more about who is working on this project. For each member of the team, please describe any project-related skills, experience, or other background you have that might help contribute to making this idea a success.''<br/><br/> | ''Please use this section to tell us more about who is working on this project. For each member of the team, please describe any project-related skills, experience, or other background you have that might help contribute to making this idea a success.''<br/><br/> | ||
<!-- Please write your response below --> | <!-- Please write your response below --> | ||
* Project manager | * Project manager (volunteer) - Mika Matsuzaki | ||
* Data scientist - | * Data scientist - Kat Thornton | ||
* Software Engineer- | * Software Engineer- Kenneth Seals-Nutt | ||
* | * Volunteer - | ||
* Volunteer - | |||
===Community notification=== | ===Community notification=== |