GroceryDB helps shoppers spot ultra-processed foods

A 50,000-item database uncovers processing levels and healthier alternatives for smarter choices.

Shopping at department store.Study: Prevalence of processed foods in major US grocery stores. Image Credit: CandyRetriever/Shutterstock.com

In an article in Nature Food, researchers analyzed data from over 50,000 products across American grocery stores to create an open-source, detailed database of information on pricing, ingredients, and nutritional value.

This database highlights the utility of machine learning methods to support public health efforts, inform consumer choices, promote access to nutritional information, and improve dietary and health outcomes.

Background

The rise in ultra-processed foods (UPFs) has improved food availability and shelf life but at the cost of health and sustainability. Evidence shows that UPF consumption, which comprises nearly 60% of calories across developed countries, may increase non-communicable diseases like metabolic syndrome and exposure to harmful preservatives and pesticides.

This has shifted focus from food security to nutrition security, emphasizing access to affordable, healthy, and safe food. However, despite UPFs being widely consumed, determining the degree of processing is challenging due to inconsistent classification systems and ambiguous food labels.

Current methods lack reliability and reproducibility, leading to varied interpretations of UPFs' health risks. Researchers call for objective, biological, mechanism-based metrics to classify food processing accurately.

Artificial intelligence (AI) offers promising solutions by creating data-driven and objective tools. A recent development is the food processing (FPro) score, an automated index using machine learning to classify food processing based on nutrient profiles.

FPro leverages the NOVA classification system and other frameworks, providing reliable and scalable metrics to enhance nutrition security and support Sustainable Development Goals like zero hunger and improved health. This innovative approach could transform how we assess and manage food processing's impact on health.

About the study

Food data was gathered from online platforms of major American grocery chains. These stores categorize food items hierarchically, and this structure was standardized across the database. Nutrition information was sourced from product labels, converted to a uniform measure (per 100g), and analyzed using the FoodProX algorithm to assess food processing levels.

A machine learning tool was used to evaluate nutrient changes caused by processing to assign an FPro score from 0 (unprocessed) to 1 (ultra-processed). The algorithm was trained on known classifications (NOVA) and validated food processing effects on nutrients.

Ingredient lists were ranked by quantity, helping calculate the ingredient FPro (IgFPro) score, which links ingredient amounts to processing levels.

The database contains detailed food and ingredient data in spreadsheet files, providing FPro scores, nutritional information, and ingredient processing scores. A substitution algorithm offers processed food alternatives by analyzing ingredient and food name similarities and sorting suggestions by FPro score.

Findings

The study utilized the FPro system to assess the processing levels of various food items by translating their nutritional content into a processing score.

For instance, organic multigrain bread, produced from whole grains without additives, had a lower processing score of 0.314), while more processed bread had higher scores due to added fibers and starches (0.732 and 0.997 respectively).

Similarly, yogurts made from organic milk had a low processing score (0.355), while others with added sugars and additives had a higher score (0.918).

Analysis of processing levels across major grocery store chains revealed that ultra-processed items dominated store inventories. However, while some offered more minimally processed options, others had a higher proportion of ultra-processed foods.

Researchers also found variability in processing levels within food categories, like cereals and snack bars, indicating diverse consumer choices in some categories.

Additionally, a 10% increase in food processing generally led to an 8.7% decrease in the price per calorie, though this varied by food type. For example, highly processed soups were significantly cheaper per calorie than minimally processed ones. This highlights the complex relationship between food processing, cost, and consumer choices.

Conclusions

This open-source database provides information and tools to analyze food processing and ingredient structures in the U.S. grocery market. Integrating large-scale food composition data and machine learning reveals varying levels of food processing across different grocery stores.

Factors like food costs, consumer socioeconomic status, and supermarket missions influence these differences. The platform highlights the link between food processing and affordability, with lower-income populations consuming more processed foods, which impacts nutrition security.

Governments increasingly recognize the health costs associated with processed foods, such as obesity-related medical expenses. This database offers insights into food processing levels, helping consumers make healthier choices by translating complex data into actionable scores.

Despite challenges in interpreting food labels, this system can guide better dietary decisions and public health strategies, like reorganizing store layouts.

The FPro algorithm evaluates food processing through nutrient concentrations but aims to improve by incorporating more comprehensive ingredient data, ultimately enhancing its reliability and consumer guidance.

Journal reference:
Priyanjana Pramanik

Written by

Priyanjana Pramanik

Priyanjana Pramanik is a writer based in Kolkata, India, with an academic background in Wildlife Biology and economics. She has experience in teaching, science writing, and mangrove ecology. Priyanjana holds Masters in Wildlife Biology and Conservation (National Centre of Biological Sciences, 2022) and Economics (Tufts University, 2018). In between master's degrees, she was a researcher in the field of public health policy, focusing on improving maternal and child health outcomes in South Asia. She is passionate about science communication and enabling biodiversity to thrive alongside people. The fieldwork for her second master's was in the mangrove forests of Eastern India, where she studied the complex relationships between humans, mangrove fauna, and seedling growth.

Citations

Please use one of the following formats to cite this article in your essay, paper or report:

  • APA

    Pramanik, Priyanjana. (2025, January 15). GroceryDB helps shoppers spot ultra-processed foods. News-Medical. Retrieved on January 15, 2025 from https://www.news-medical.net/news/20250115/GroceryDB-helps-shoppers-spot-ultra-processed-foods.aspx.

  • MLA

    Pramanik, Priyanjana. "GroceryDB helps shoppers spot ultra-processed foods". News-Medical. 15 January 2025. <https://www.news-medical.net/news/20250115/GroceryDB-helps-shoppers-spot-ultra-processed-foods.aspx>.

  • Chicago

    Pramanik, Priyanjana. "GroceryDB helps shoppers spot ultra-processed foods". News-Medical. https://www.news-medical.net/news/20250115/GroceryDB-helps-shoppers-spot-ultra-processed-foods.aspx. (accessed January 15, 2025).

  • Harvard

    Pramanik, Priyanjana. 2025. GroceryDB helps shoppers spot ultra-processed foods. News-Medical, viewed 15 January 2025, https://www.news-medical.net/news/20250115/GroceryDB-helps-shoppers-spot-ultra-processed-foods.aspx.

Comments

The opinions expressed here are the views of the writer and do not necessarily reflect the views and opinions of News Medical.
Post a new comment
Post

While we only use edited and approved content for Azthena answers, it may on occasions provide incorrect responses. Please confirm any data provided with the related suppliers or authors. We do not provide medical advice, if you search for medical information you must always consult a medical professional before acting on any information provided.

Your questions, but not your email details will be shared with OpenAI and retained for 30 days in accordance with their privacy principles.

Please do not ask questions that use sensitive or confidential information.

Read the full Terms & Conditions.

You might also like...
The synergy of nutrition and traditional medicine for holistic health and wellbeing