Editing Mika/Temp/WikiFCD
From WikiDotMako
Warning: You are not logged in. Your IP address will be publicly visible if you make any edits. If you log in or create an account, your edits will be attributed to your username, along with other benefits.
The edit can be undone. Please check the comparison below to verify that this is what you want to do, and then publish the changes below to finish undoing the edit.
Latest revision | Your text | ||
Line 52: | Line 52: | ||
[[File:Marketvegetables.jpg|thumb|200px]] | [[File:Marketvegetables.jpg|thumb|200px]] | ||
'''We propose a Wikibase instance, WikiFCD, to create a global nutrient database - or [https://en.wikipedia.org/wiki/Food_composition_data Food Composition Data (FCD)]. This wiki-based system will engage participants from diverse wiki communities to make this database universally accessible, up-to-date, and comprehensive.''' | |||
Food Composition Data (FCD) provide nutrient data for processed/cooked (e.g. veggie burger, hard-boiled eggs) and unprocessed (e.g. apples) food. Despite several attempts by research institutes and intergovernmental agencies to create a global FCD in the past, none has succeeded in developing a global FCD. Development and maintenance of such database are difficult if the contributors are limited to small sets of researchers and employees in this field. The wiki system bring a better solution to this problem. | |||
Many FCDs are available online, although they come in various different formats (e.g. PDF, CSV) with varying degrees of details in content. Nutrient content of unprocessed food can also vary for the same item because of factors such as climate and terroir. Area- and time-specific data are often missing in the current efforts to understand nutrition and health. Importantly, even though there are also wide regional variations in foods that are commonly consumed, some places lack access to regionally appropriate FCD, up-to-date FCD, or FCD in their own languages, leading to disparities in data availability and accessibility and ultimately, in scientific evidence in health research. | |||
We propose a pilot project to write schemas to describe our data model based on five large food composition datasets that are already available online. The need for diverse participants in this project is very much in line with the missions of projects supported by Wikimedia Foundation and, through this pilot project, we hope to show how peer production can contribute to the improvement in data/knowledge disparities in global nutrition. | |||
===What is your solution to this problem?=== | ===What is your solution to this problem?=== | ||
Line 64: | Line 68: | ||
'''1. What is the solution to this problem?''' | '''1. What is the solution to this problem?''' | ||
We will test several automated and manual methods to populate the wikibase with nutrient data from 5 food composition databases from around the world (see [[Mika/Temp/WikiFCD#Project_plan|Project Plan]] section for details). We will write schemas to describe our data model. We will map our properties to Wikidata properties. | We will test several automated and manual methods to populate the wikibase with nutrient data from 5 food composition databases from around the world (see [[Mika/Temp/WikiFCD#Project_plan|Project Plan]] section for details). We will write schemas to describe our data model. We will map our properties to Wikidata properties. | ||
Line 73: | Line 73: | ||
'''2. Why is this a good idea?''' | '''2. Why is this a good idea?''' | ||
* First, this | * First, this wikibase system will significantly improve the usability of FCD from different sources for diverse users - from health-conscious individuals to academic researchers to public health workers. WikiProject food and Drink on [https://en.wikipedia.org/wiki/Wikipedia:WikiProject_Food_and_drink English Wikipedia] and [https://www.wikidata.org/wiki/Q8485990#sitelinks-wikipedia its equivalents in other languages] are universally popular WikiProjects among editors and likewise, many articles on food and drink are within the top 10% of any Wikipedia's articles by pageviews. This new project can contribute to a topic that is of high interest to many people. | ||
: Building a structured dataset is also a key step in identifying most appropriate data to borrow in resource-poor settings where up-to-date, detailed, and regionally appropriate FCD are not readily available. This new database will also open up ways to explore new research questions to explore more nuanced nutrition data (e.g. changes in nutrient content of the same product, depending on the climate conditions of the year), which can potentially make substantial advances in nutrition and health research. | : Building a structured dataset is also a key step in identifying most appropriate data to borrow in resource-poor settings where up-to-date, detailed, and regionally appropriate FCD are not readily available. This new database will also open up ways to explore new research questions to explore more nuanced nutrition data (e.g. changes in nutrient content of the same product, depending on the climate conditions of the year), which can potentially make substantial advances in nutrition and health research. | ||
* Secondly, by creating an instance of Wikibase for this project, we will be able to design our own data models | * Secondly, by creating an instance of Wikibase for this project, we will be able to design our own data models to incorporate data from heterogeneous data sources. If subsets of the data are appropriate for Wikidata, we will be able to provide machine-actionable ShEx schemas that will help us prepare data for other systems. In this way the data will be readily-available for incorporation into Wikidata if desired. | ||
* Finally, we will | * Finally, we will complete this project with diverse communities from around the world as these FCD can be translated into/from many languages. The design of Wikibase will allow us to more easily support additional languages in the data itself, as well as in user interfaces. | ||
==Project goals== | ==Project goals== | ||
Line 121: | Line 121: | ||
<!-- Please write your response below --> | <!-- Please write your response below --> | ||
# 50 participants covering at least 3 languages. | |||
# 50 participants covering at least | |||
# 5 new data sources. | # 5 new data sources. | ||
Line 197: | Line 195: | ||
| Software engineer (10 hours per week for 8 months)|| $30x10x34 = $10,200 | | Software engineer (10 hours per week for 8 months)|| $30x10x34 = $10,200 | ||
|- | |- | ||
| Community outreach | | Community outreach intern (8 hours per week for 8 months) || $25x8x34 = $6,800 | ||
|- | |- | ||
| Server hosting (12 months) || $22x12 = $264 | | Server hosting (12 months at Johns Hopkins School of Public Health) || $22x12 = $264 | ||
|- | |- | ||
| | | Event costs || $1000 x 5 = $5,000 | ||
|- | |- | ||
| Travel (Wikimania 2020, 2 people) || $ 4,000 | | Travel (Wikimania 2020, 2 people) || $ 4,000 |