Tools:
recipe scrapers: https://github.com/hhursev/recipe-scrapers
NYT ingredient phrase tagger: https://github.com/NYTimes/ingredient-phrase-tagger
APIs
ESHA Nutrition Database API https://esha.com/products/nutrition-database-api/
ANSES-CIQUAL French 2020 food composition table version 2020 https://ciqual.anses.fr/#/cms/download/node/20, a table in excel or xlm format that breaks down various types of food (soups, deserts, dishes, vegetables, etc.) in its basic components (energy, water, protein, carbohydrate, etc.).
Get your recipe on Google: https://developers.google.com/search/docs/data-types/recipe#recipe-properties
The Edamam APIs including a Nutrition Analysis, Food Database, and Recipe Search API (search over 2.3 million recipes by diets, calories and nutrient ranges)
Spoonacular offers a variety of widgets that allow you to work with food data, including an ingredient visualizer widget which lets you show a recipe‘s ingredient list in a visually appealing way, price visualizer widget which lets you show the total estimated cost for a recipe, a taste visualizer widget which lets you display the taste distribution of an ingredient or an entire recipe, a nutrition visualizer widget which lets you display macro and micro nutrients of a recipe, and an equipment visualizer widget which lets you show the tools required to make recipe. It also has a free list you can download with 'the 1000 most frequently used ingredients’.
Data sets:
RecipeQA: A Dataset for Multimodal Comprehension of Cooking Recipes
Recipe1M+: A Dataset for Learning Cross-Modal Embeddings for Cooking Recipes and Food Images
Recipe5K: dataset for ingredients recognition with 4,826 unique recipes composed of an image and the corresponding list of ingredients
YouCook2 is one of the largest task-oriented, instructional video dataset in the vision community
Food.com Crawled data from Food.com (GeniusKitchen) online recipe aggregator
The Rakuten dataset with about 800,000 recipes and images
Ontologies & knowledge extraction:
...