All of human cooking compressed into 2 megabytes

(arxiv.org)

414 points | by josefchen 1 day ago

48 comments

  • epsteingpt 22 hours ago
    The work is very interesting. The title is misleading.

    A better title would be: "all of human ingredients compressed into 1,800 primitives"

    There is little to substantively nothing about the actual cooking: preparation methods, proportions, etc.

    But the idea that tomato goes well with beef the whole world over is very interesting and useful for creating flavors that will go together, perhaps surprisingly. It will be a nice resource in the future.

    • fps-hero 21 hours ago
      I have a wonderful book that explores this idea of an atlas of flavours that work together.

      The flavor bible.

      I can assure you that it does not contain 1800 ingredients in all of there combinations, but it does a remarkable job of covering a widely used selection of herbs spices vegetables and meats. I doubt a compressed version of the text would even be very large.

      The trouble I find with LLM generated recipes is they miss the nuance of the technique. Often the success of a depends on a single step or ratio. For instance “fried chicken” has a million incarnations the world over, but you can’t just average out the recipes and end up with tasty fried chicken.

      • FuriouslyAdrift 20 hours ago
        Ruhlsman's "Ratio" is also quite good at distilling the mechanics of food into an algorithm of sorts.

        https://www.simonandschuster.com/books/Ratio/Michael-Ruhlman...

      • theodorewiles 16 hours ago
        yeah I took a look at this and others and tried to pull together some helpful 'flavor maps':

        https://transcendent-choux-d1b930.netlify.app/

      • stuaxo 19 hours ago
        Sounds like the flavour theaurus
        • blitzar 19 hours ago
          Sounds like the flavour colour wheel.
          • xp84 17 hours ago
            Sounds like the flavour roulette wheel.
            • ian_j_butler 15 hours ago
              This is exactly like when Skippy is trying to work out the flavor profiles for the kinds of mush that monkeys will find most appealing. Yes, we know that "chocolate peanut butter bananas" is the true king of flavor latent space, but even a slight error in floating point precision could stick you with "cantor's peach butt nut abalone".
      • ZeWaka 18 hours ago
        Sounds like The Flavor Bible - it's a great reference book to find pairings of ingredients.
        • JadeNB 15 hours ago
          > Sounds like The Flavor Bible - it's a great reference book to find pairings of ingredients.

          I think fpshero says exactly that (https://news.ycombinator.com/item?id=48294459):

          > I have a wonderful book that explores this idea of an atlas of flavours that work together.

          > The flavor bible.

      • com2kid 18 hours ago
        > The trouble I find with LLM generated recipes is they miss the nuance of the technique. Often the success of a depends on a single step or ratio. For instance “fried chicken” has a million incarnations the world over, but you can’t just average out the recipes and end up with tasty fried chicken.

        Specify what technique you want. Explicitly say you want to correctly follow all the techniques of the chosen cuisine.

        All the LLMs have ingested nearly every cookbook ever made, across multiple languages.

        You can upload a photo of your spice rack (with visible labels) to ChatGPT and tell it to save your pantry ingredients as a memory.

        LLMs are absurdly overpowered for cooking, when used right. If you ask it for a week long meal prep plan the results will be meh, but ask it for kheer inspired rice crispy treats (which everyone reading this should to, kheer rice crispies are the best!) and you'll get some solid results.

        You may notice at first the LLM will still water things down for "American" tastes. With Claude/ChatGPT you only need to remind it once or twice not to do that and it'll course correct all future conversations.

        • dns_snek 16 hours ago
          > All the LLMs have ingested nearly every cookbook ever made, across multiple languages.

          That's not a positive thing, good recipe developers are Rare. For every recipe that's been meticulously tested and documented there are 1000 that haven't been. Many cookbooks are riddled with errors.

          • com2kid 9 hours ago
            Sure, but most recipe books are just copies of other good recipe books. There are only so many ways to bake cookies.

            I've always been a pretty good cook, but I've been able to pull off some really cool stuff with the help of ChatGPT lately. It is probably just an incremental lift, and I still catch it making errors from time to time, but it has been a huge help in the kitchen.

          • MengerSponge 11 hours ago
            Come on now how bad could it be? Wisdom of crowds and all that...

            https://www.bbc.com/news/articles/cd11gzejgz4o

            Oh. Oh no.

          • cindyllm 1 hour ago
            [dead]
      • jona-f 5 hours ago
        Obviously the llm has no idea how the recipe will taste. You will have to cook it many times yourself to really understand a dish. A good cookbook sets you up right away with a tasty start, but doesn't give you experience. A good cook has so much experience that they can actually out-of-domain predict a recipe from the taste, but an llm can't do that, cause it fundamentally has no idea how things taste to humans. They are a great starting point for recipes though and I'd argue they don't give you the average of all recipes, cause you can specify quite detailed which "authentic" version you want and the models generally to a decent job.
        • simulator5g 4 hours ago
          Besides that, taste is only a small part of the experience of eating something. You can't use an algorithm to cook good food if it has no reference to smell, texture, appearance, psychology, nutrition, culture, etc. You don't even really decide what to eat unless you put effort into the decision, your gut mostly drives you around via the vagus nerve. So a good recipe needs to appeal to your gut perhaps more than it appeals to you.
      • hansvm 7 hours ago
        Being able to wade through the bullshit is a valuable skill, and the worse your cooking is the less valuable an LLM will be. That said, prompting matters. Something like the following, chased with any necessary questions to ground the details at your skill level, is likely to get you an above-average result. Better cooking skills allow you to ask better follow-on questions, or (context windows matter) create new sessions probing other things you care about.

        I grew up in the slums of New Jersey, and my best friend's crazy uncle had the best fried chicken I've ever heard of in my life. I've eaten the best $10k omakase you can find and other amazing food the world over, and I can't even begin to describe what sets his chicken apart from literally every other food. How did he work that magic? You've been accoladed for your work uncovering the best, most unique flavors our civilization has to offer. Can you recreate that trip to heaven?

        Some generic follow-on questions (in line with the trope of "now make it better") include:

        1. That's all well and good, but I'm an experienced chef, and I know all of those elementary basics. Something is still missing. What made that meal the best in the world?

        2. Pick something the LLM said, and focus on that as if it successfully caught on to an important detail (e.g., in-context, IME you'd want to latch on to anything the LLM offered regarding buttermilk or fermentation).

        3. Take whatever you learned and start the fuck over. Use another context window to brainstorm a different, more appropriate persona if you can't come up with one on your own (the choice of New Jersey wasn't especially important -- just a concrete detail likely to elucidate ideas you won't see otherwise), and ask again in another session with a better persona and incorporating whatever you've learned and any inspiration you've taken.

        4. The initial question was a little open-ended. Ask the LLM to expand its results into 2-3 concrete, orthogonal directions capable of generating those experiences, fleshing out the details into full recipes.

        5. I'm sure the secret wasn't just the chicken. Drinks, sauces, music, and everything else played into it. How did he make that feel alive?

        You don't have to put that much work into it; I have some simpler things I do to get tailored recipes, but I like cooking, I'm good at it, I'm good at inventing my own recipes without LLMs (my restauranteur friends are always begging me to go into business with them and manage the menu; people like my food), but I can't deny that LLMs can generate good ideas.

        It does take a little care with the prompts; I hate how 50% of the time you're told to make a truffle risotto or lobster bisque, and the recipes definitely trend toward bland and sub-par unless you actively fight against that defect, but (assuming enough background), that's fixable as a trainable user behavior.

    • niek_pas 20 hours ago
      Unless I'm missing something, there's also nothing in the paper to indicate this is "all of human ingredients"? It looks like it's 11 data sources covering a bunch of common cuisines, with the English + Chinese sources accounting for 90% (!) of the dataset. Among others, Africa and the Arab world are not present in the data (good for about 25% of the global population).

      Also, all non-English terms were AI-translated to English which is methodologically understandable but surely leaves room for error.

      • order-matters 19 hours ago
        translation is an interesting problem in and of itself still. its kind of a miracle we can do it at all, yet in some circumstances it seems obvious for there to be objective answers (cooking ingredients being one of them), but even then you never really know even with human translators if you've got it correct. even within the same language nearly every individual has their own version of it.

        for example, how would you translate "chips" to another language without first knowing which version of English you are translating from? could be an american speaker with a british relative and they use the british definition of chips while otherwise mostly speaking american english.

        there's a level of pragmatism in translation that needs to be assumed, and ultimately we have to accept that translated knowledge will always have low resolution. There is a layer of work that needs to be done with the source of the materials involvement to get written content to a level of formalism needed to be representative of the language it is written in. Generally, the work of editors. Which means successful translation for wide distribution, while still not guaranteed, is predicated on the editorial skills of the translator which begs for dialogue with the source.

        Meanwhile, AI provides this super convenient band aid to get translation results you can't disprove.

        I genuinely think people are severely underestimating the power held by these models for being translators and how literal truth is going to be determined by them deep behind the scenes under the disguise of accessibility. Not in a dangerous way necessarily, just in a way where what languages are and what words mean is going to shift towards whatever the models think they are.

        In a way, over extended time, the models will not be wrong about the translations because their results will redefine what successful formal editing of language looks like, and disagreeing with them will amount to the same difference as having local slang.

      • dyauspitr 18 hours ago
        Leaving out Indian, Southeast Asian and Arab cuisine means this is nigh useless.
        • argee 17 hours ago
          There are 2,000+ varieties of mangoes alone. You could literally end up with a larger file using only mangoes.
          • zeroimpl 15 hours ago
            Was going to give the same example with chili peppers. Tons of varieties and not exactly interchangeable
            • buggymcbugfix 8 hours ago
              Thousands of cheeses, each of which is a unique experience. Heck, even the serving temperature completely alters the experience. Next: wines, charcuterie, ...

              Pity the fool who can't taste the difference between any of these.

          • momoschili 6 hours ago
            there are thousands of varieties of a lot of things though...
        • schemathings 6 hours ago
          Use ChaatGPT
        • teleforce 8 hours ago
          It's worst than useless, it's borderline criminal /s

          The fabricated title targeted the sensation rather than substance, typical scenario whenever "All" is in the title, and the worst when it's in the very first word.

    • Tade0 22 hours ago
      > But the idea that tomato goes well with beef the whole world over is very interesting

      I saved a beef stew I was making for twelve people once by adding tomato sauce.

      Beef hardens if stewed incorrectly and tomato acid tenderises it again.

      EDIT: removed incorrect information about store bought tomatoes.

      • foobarian 16 hours ago
        I have a hard time believing that that weak an acid can have a noticeable effect vs just the extra cooking time. Sorry for being a skeptic :-) Most of the time I deal with beef either it's the particular chunk of meat I bought that day or marinating it in salt for over a day or just stewing it for a long amount of time.
        • dylan604 13 hours ago
          Never doubt the power of a weak acid in the right use though. Storing tomato sauce in plastic food savers is a good example
    • CTDOCodebases 22 hours ago
      If you are interested in that you might want to check out this paper:

      https://www.nature.com/articles/srep00196

      • HappyPanacea 22 hours ago
        I would like one day to have a database which measure how strongly every food ingredient in use binds to every human smell receptors.
    • dbt00 19 hours ago
      Tomatoes are high in glutamate, which accentuates beef flavor.
    • cloverich 18 hours ago
      One that has long tickled me is cabbage +/- pickling. I eat both sauerkraut and kimchi from the jar and enjoy them as additions to _roughly_ the same foods, and when friends/family ask I insist they are basically the same thing anyways, but they are uninterested in such shenanigans. I'd love to learn more about these cross cultural shared foods.
    • Bengalilol 20 hours ago
      +1.

      On a side note (and maybe off topic), I am thinking about food pairing which is based on the idea that two ingredients sharing volatile aroma compounds or certain molecular families may have a potential sensory compatibility (broccolis and strawberries for example). I'd love to test those ingredients and find some unknown food pairings. But .. time is what it is (for now).

      • foobarian 16 hours ago
        Wouldn't it be great if we had a simulator like the MIT violin simulator [1] but for cooking ingredients! Then you don't have to throw out pounds of perfectly good ingredients just because broccoli doesn't go with Nutella.

        [1] https://news.mit.edu/2026/mit-engineers-virtual-violin-produ...

      • dheera 20 hours ago
        I think it's a lot simpler than that. A common pattern for sauces is fat + acid + salt

        - (Mexican) avocado and lime/lemon + salt

        - (Chinese-southwestern) chili oil and vinegar + salt/fermented bean paste

        - (Italian) olive oil and tomato + salt

        - (Turkish) olive oil and lemon + salt

        - (Thai) coconut milk and lime + salt

        ...

        • esafak 16 hours ago
          https://www.saltfatacidheat.com/

          This book was an eye opener for me. Obvious in retrospect; I wondered how I did not notice it myself.

          • SoleilAbsolu 16 hours ago
            I was gonna post the same, as a lifelong cook (and eater!) Samin Nosrat's book/show was essential for giving me the confidence to improvise in the kitchen while retaining authenticity to regional cuisines.
    • culi 15 hours ago
      It's also based on 11 sources across just 7 of the 7,150 human languages (English, Chinese, Russian, Vietnamese, Spanish, Turkish, Indonesian, German, and Indian-English)
    • cromka 18 hours ago
      > all of human ingredients

      Depending on who you ask, this may also sound misleading

    • bijowo1676 16 hours ago
      can anyone check whether pineapple goes well with baked dough with cheese/tomato sauce?
      • mlmonkey 16 hours ago
        only in Hawaii ...
    • neuroelectron 19 hours ago
      It's cheaper to train a robot how ingredients go together than to cook for humans.
  • leontrolski 23 hours ago
    Neat.

    I'm trying to compress recipes into little schematics https://leontrolski.github.io/recipes.html

    • teeray 22 hours ago
      I like it. Reminds me a bit of the table format on Cooking for Engineers (scroll to the bottom of the recipe): https://www.cookingforengineers.com/recipe/19/Erics-Chocolat...
    • michelb 22 hours ago
    • NiloCK 22 hours ago
      Ahh - the dependency graph recipe card. These are excellent. I've imagined something like this forever. Always annoyed that recipes put ingredients in a giant undifferentiated list and then give an instruction like "mix the dry ingredients in a deep bowl".

      For a while I expected there could be a good return on a good implementation of this, but now as soon as a strong interface itself is created it seems easy to copy.

      • whtrbt 20 hours ago
        I used to want this kind of recipe card, but I've cooked so much that's no longer the case (I actually forgot all about the idea until reading this comment). I can usually look at the list of ingredients, and imagine what needs to be done. If it's an unusual or unfamiliar cuisine I will read the method, but after that point the list of ingredients suffice. If I read a recipe somewhere and want to cook it later, I will just write the ingredients on a post-it (usually in cooking order) and maybe 1-2 brief comments.

        I imagine in domains you are skilled at you'd also prefer high level instructions than a step-by-step tutorial.

        I agree that doesn't help the beginner, or someone who doesn't cook regularly, or someone cooking something new and I think most recipe writers are just following the established structure without thinking about what they and others really need.

      • gorgoiler 22 hours ago
        ”To bake an apple pie from scratch, first you must create the universe.”

        — Carl Sagan

      • vrganj 21 hours ago
        Feels like one might be able to get an llm to convert an annoying to read recipe into a mermaid dependency graph following this example. Might be worth a try!
        • utopiah 20 hours ago
          Just tried https://www.mlynn.org/tools/generate-diagram and it didn't work.

          FWIW though most recipes are basically ~10 steps long so a simple list suffice.

          Still it could be an interesting experiment as I imagine that precisely recipes that are less sequential are (on average, with as challenging steps, e.g. excluding making caramel which has a high chance of burning) perceived as more complex.

    • duckmysick 19 hours ago
      That's a very cool site! I enjoyed the hand-drawn graphs.

      Your tables remind me of recipes in Modernist Cuisine. They all have ingredients grouped by the procedures together with weight, sometimes volume, and ratio.

      Example: https://modernistcuisine.com/wp-content/uploads/2013/01/Mac-...

    • mapipolo 22 hours ago
      I love this! I bet you could make a successful recipe book based on this concept, with large schematics that a cook can read from a distance while working in the kitchen.
    • infinite8s 12 hours ago
      This is like "Picture Cook: See, Make, Eat": https://www.amazon.com/Picture-Cook-Katie-Shelly/dp/16460416...
    • ekglimmer 17 hours ago
      This is cool - I tried automating different recipe formats/visualizations at https://www.gobsmacked.io. I was thinking of trying to do something similar to this next.
    • ultimatefan1 22 hours ago
    • Uncle_Brumpus 21 hours ago
      I really like these. I went through a phase a couple years ago where I got really into cooking new fancy recipes, and having to scroll around on recipe pages, or try and read my own chicken scratch notes or understand the context I was trying to imply when I wrote the notes weeks ago was a struggle. Having everything more or less right there in front of your face seems really nice.

      And I don't know why, but "Beans (green)" is really tickling my funny bone.

    • culi 15 hours ago
      Cute! I have a lot of handwritten notes. I've been waiting for a competent "paper scan to SVG" workflow to be developed so I can digitalize them properly. I think your project would also greatly benefit from something like that
    • karhuton 22 hours ago
      These are amazing. It feels so clear to see a visual ”map” of the cooking process before you even start.

      This would help coordinate two cooks to make prepping more independent.

      I’m trying to figure out if an landscape Ipad, with interactive elements for extra details if needed, would be a good UI for this.

      -

      Edit: Showed it to my non-Engineer wife and she said ”this is horrible” after staring at it for 10 seconds. Maybe not for everyone…

    • drbig 13 hours ago
      Thank you for sharing!

      This is inspiring. Cooking is this sort of an activity that is simple enough to not be overwhelming but also complex enough to be very interesting.

      Both in practice and in modelling :-)

    • goobatrooba 16 hours ago
      Nice ideas, it just misses amounts. And of course there may be nuances of technique missing, e.g. low or high heat. So it might work best as a recap and starting point "how to vary m favourite recipe".
    • dhx 20 hours ago
      This will perhaps be overkill for what you're trying to achieve, but there is Object Process Methodology (OPM)[1] which may be inspiration for modelling recipes. It looks similar to the methodology you're using, and as an example, expects the following type of modelling:

      "peeling carrot" (process) consumes "washed carrot" (object)

      "peeling carrot" (process) yields "peeled carrot" (object)

      "peeling carrot" (process) yields "carrot peel" (object)

      "finely dicing carrot" (process) consumes "peeled carrot" (object)

      "finely dicing carrot" (process) yields "finely diced carrot" (object)

      "prepare mirepoix" (process) consumes "finely diced carrot" (object)

      "prepare mirepoix" (process) consumes "finely diced celery" (object)

      "prepare mirepoix" (process) consumes "finely diced onion" (object)

      "prepare mirepoix" (process) consumes "butter" (object)

      "prepare mirepoix" (process) yields "mirepoix" (object)

      The advantage of OPM is alignment of graphical and textual representations.

      The downside of such approach is you soon discover how many millions of objects may exist in recipes--unwashed carrot, washed carrot, orange carrot, purple carrot, yellow carrot, white carrot, peeled carrot, coarsely diced carrot, finely diced carrot, julienned carrot... and purple julienned carrot vs. yellow julienned carrot. And that's just basic preparation complexity well before any contemplation of cooking or plating up elements. To go further you then discover a lack of useful labels such as "mirepoix" or "soffrito", if for example, you wanted to substitute sweet potato in place of carrot in the recipe.

      Then there is SysML 2[2] which is kind of like OPM if you ever wanted to write a recipe in 35,000 lines of code, including possibly all the complexity of mathematical modelling of the Maillard reaction for purple carrots vs. yellow carrots using either extra virgin olive oil or butter. Probably best suited for the largest food processing companies such as Nestle, Unilever, Modelez, etc and even then, inherent complexity of their food products rarely would reach the level of a fine dining dish prepared by a chef.

      [1] https://en.wikipedia.org/wiki/Object_Process_Methodology

      [2] https://www.omg.org/spec/SysML/2.0/Language/PDF

      • YeGoblynQueenne 16 hours ago
        OPM looks like a great way to implement a free-style crafting system for a roguelike. I mean instead of having standard recipes that you have to assemble you could invent your own by putting together basic ingredients but then you need a way to know what comes out (and its effects I guess). I don't know if any existing games do that.
        • dhx 11 hours ago
          Also have a look at Inform 7[1] (domain specific language for interactive fiction), specifically the "Stone" example section[2] which explains edible food and the effects different foods may have on the adventurer.

          It's similar concept to OPM with some modelling basics already built on top for player movement throughout a world, player interaction with objects (looking at, lifting, moving), and many other primitives needed to write interactive fiction. And relevant to this thread, Inform 7 of course has modelling basics for a player eating food, drinking potions, etc.

          [1] https://en.wikipedia.org/wiki/Inform#Inform_7_programming_la...

          [2] https://ganelson.github.io/inform-website/book/RB_9_1.html

        • ccozan 13 hours ago
          I mean Minecraft is quite similar in concept. Start with wood and stone and end up with an atomic bomb.

          Also useful to watch Dr. Stone anime.

    • eob 14 hours ago
      This is how I write down recipes for myself! There’s got to be some cognitive archetype that works better with this style encoding.
    • DoctorOetker 21 hours ago
      a lot of the schematics have avoidable edge crossings, that could improve intuitive readability enormously, theres entire subfields of graph theory that consideres rendering of graphs and planar embeddings.
    • felooboolooomba 21 hours ago
      Nice! I do similar but yours are much more sophisticated. I list the ingredients and then group them at the side with instructions.
    • InsideOutSanta 22 hours ago
      That's really neat and easy to parse, love it!
    • Bengalilol 20 hours ago
      Great initiative ! So simple and yet it works (from those recipes listed I know, they are all working nicely).
    • treis 20 hours ago
      My God what do you do to your feta?
      • k2enemy 14 hours ago
        Either crumble by hand or beat with an open end wrench.
    • hypercube33 13 hours ago
      The flowchart factorio of food preparation. Neat
    • severine 19 hours ago
      I love it too, bookmarked and hopefully adopted! Did you write about it somewhere?
    • chamomeal 21 hours ago
      Dang this is the best way to represent a recipe I've ever seen. I love it
    • danielvaughn 22 hours ago
      It's amazing how much more readable this format is. I love it.
    • damnitbuilds 20 hours ago
      I'd prefer it if you minimized the crossing paths :-)
    • globular-toast 13 hours ago
      I write down recipes similar to this. I realised one day it's very similar to how a makefile works: https://blog.gpkb.org/posts/cooking-with-make/
    • addedGone 21 hours ago
      Recipes-as-JSON?
    • leoh 17 hours ago
      This is really really awesome
    • vrganj 21 hours ago
      Now this I love. It respects the craft of cooking and the human element, while giving instructions in an easy to grok and straightforward way.

      Great job!

    • hkt 22 hours ago
      That is brilliant. Going to try some of yours then maybe transcribe my own favourites into the same format. You've struck on a great idea here.
  • utopiah 15 hours ago
  • coldtea 21 hours ago
    >from 11 sources spanning seven languages, English, Chinese, Russian, Vietnamese, Spanish, Turkish, Indonesian, German, and Indian-English

    So hardly "all of human cooking"...

    • bhouston 19 hours ago
      That does represent 70% or so of the current world population. It isn't 100% coverage but it is most of the many of the biggest groups.

      It is missing the Italian, Japanese, Greek and Mexican cooking - that are incredibly popular worldwide and it is incomplete without them, and nothing from Africa at all or Middle East.

      • scottlamb 18 hours ago
        > It is missing the Italian, Japanese, Greek and Mexican cooking - that are incredibly popular worldwide and it is incomplete without them, and nothing from Africa at all or Middle East.

        That's overstating it. There are certainly English-language sources describing Italian, Japanese, Greek, Mexican, African, and Middle Eastern recipes. They're likely not the most authoritative sources, but it's not as if I'd expect these cuisines to be completely absent.

        The actual corpuses they used are listed in the supplement: https://arxiv.org/src/2605.22391v1/anc/supplement.pdf

        edit: that document also breaks it out by region, including 33,923 Japanese recipes which seems respectable. 324 from Sub_Saharan_African which is tiny but still more than 0. Italian and Greek are likely a fair chunk of Mediterranean (164,107). I don't see a breakout for Middle Eastern. Some might be lumped into Mediterranean as well.

        • bhouston 11 hours ago
          Ah. So it was English language sources and not English cooking. Got it.
      • coldtea 11 hours ago
        >That does represent 70% or so of the current world population.

        Cooking is about different cuisines and recipes of important, not population count.

        Italy is "just" 60M people but has huge cuisine and big influence in global tastes. France too. Britain is similar sized, and Congo is 2x that, but none has much of a cuisine. Peru on the other hand, is half that, but a great cuisine.

    • AStrangeMorrow 21 hours ago
      Yes. I mean if you look at the corpus basically HALF of recipes are Chinese/Korean.

      They do quickly acknowledge it, but definitely not a balanced set.

  • dhx 21 hours ago
    See [1] for a demo, seemingly of an older iteration of what this paper describes. I was curious what ingredients the demo had selected (1032 available vs 1790 this paper selects) so I tried some obscure ingredients from "Organum: Nature, Texture, Intensity, Purity" by Peter Gilmore[2] (known for Quay restaurant in Sydney, Australia).

    It's got some adventurous ingredients such as juniper berry, macadamia nut, nigella seed, orange blossom water and lemon verbena. It even separates sesame oil and toasted sesame oil. Even though the ingredients list only has "rice", "black rice", "brown rice" and "glutinous rice", when you select "rice" as an ingredient, the recipes it generates are smart enough to advise of chilling cooked jasmine rice before using in a fried rice, and smart enough to soak and rinse Basmati rice before using in a pilaf. If selecting "lamb" as an ingredient, the recipes it generates will choose the cut as shoulder or shank if you select vegetables normally associated with braising.

    It doesn't know of grapeseed oil, orzo, mangosteen, lemon myrtle, and of course anything that only Peter Gilmore might use in a recipe and most chefs would have never heard of (karkalla as an example). I don't see this being too much of a limitation because such ingredients are quite localised or speciality. It knows of "pumpkin seeds" but not "pumpkin"--that is "squash", so there are some localisation improvements which could be made to improve British and American English use. I tried pairing "lamb" and "avocado" together in the hope it'd generate a recipe with a salad, but this failed. I then realised the ingredients list doesn't include lettuce or rocket, but has "salad greens" instead (American English) and no matter what I tried (other salad ingredients, chicken or no protein), it would not give me a salad. It kept generating wannabe-fancy dishes of a chunk of protein surrounded by tomato gel (agar agar) and a smear of avocado, or similar.

    [1] https://epicure.kaikaku.ai/

    [2] https://en.wikipedia.org/wiki/Peter_Gilmore_(chef)

    • BigTTYGothGF 18 hours ago
      > "pumpkin"--that is "squash", so there are some localisation improvements which could be made to improve British and American English use

      That's a much bigger issue than just wording differences. As an American, there's several different squashes in common use of which pumpkin is only one. (acorn, butternut, and spaghetti are the ones I'm thinking of; zucchini if you want to be pedantic).

      • dhx 11 hours ago
        Agreed. My comment was to highlight that if a recipe for a soup just says "1kg squash", that could mean anything from "Cucurbita maxima subsp. maxima var. Jarrahdale"[1] through to "Cucurbita pepo subsp. pepo var. recticollis"[2], with vastly different outcomes for the soup.

        The model under the hood should probably have ingredients as parts of a taxon, then have common names mapped (many:many) to these parts of taxons. Then it's necessary to have abstract classifications such as "pumpkin seed" which could be defined as the seed of multiple different taxons, which for some recipes, may not matter which one of 5 Cucurbita subspecies is used. That way if someone types "squash" or "pumpkin seed" they get asked to clarify what they mean, which will change quite a bit depending on locality of the person being asked.

        [1] https://en.wikipedia.org/wiki/Jarrahdale_pumpkin

        [2] https://en.wikipedia.org/wiki/Straightneck_squash

      • OJFord 13 hours ago
        In BrE too, not sure why that was related to regional difference if they're called pumpkins there too.
  • rldjbpin 1 hour ago
    clickbait aside, quite an interesting concept. i wonder if there is a word2vec moment with different ingredients/flavour profiles with these embeddings.

    as correctly pointed by others, this can be redone with a more representative data source, but looking forward to see the effectiveness of this approach.

  • bhouston 21 hours ago
    I saw this on X/Twitter. I do not believe that human cooking, and all of its techniques and ingredients and the various ways that things can be prepared in different cultural contexts can be compressed in to 2 megabytes.

    It is sort of like saying here is a 1GB model that can do tool calling and coding and then you try it out and it barely functions. Yes, it technically is a 1GB coding model, but it isn't a good one.

    • doug_durham 19 hours ago
      The space of palatable human food is small. There are only a few thousand ingredients and a few thousand preparation techniques. This could easily be compressed at high fidelity into a model.
      • bhouston 15 hours ago
        I say prove it. If it was a coding bot we would see some metrics on how it did. Have this thing produce a bunch of on demand recipes and judge it against human chefs. Have the people preparing the food cook according to its descriptions faithfully as either robots or non-cooking people so that their knowledge doesn't leak into the preparation. And then let's judge those recipes.

        IF you need experienced culturally knowledgeable chefs to prepare the food for it to work, then you haven't encoded all the techniques, just crib notes.

      • blovescoffee 11 hours ago
        There are only a “few thousand ingredients” seems like orders of magnitude off
        • boca_honey 9 hours ago
          I'm sorry to repeat myself but this bothers me because this is my specialization.

          There are less than 7 thousand ingredients [1]. Even if you think it's way more than that on account of underrepresented cultures (which does not seem to be the case in this particular study), it's still just a few thousand. Most cuisines use around 50 of them.

          [1] https://flowingdata.com/2018/09/18/cuisine-ingredients/

          The total number of cooking techniques should be in the hundreds (not thousands), to be generous, even accounting for historical / prehistorical techniques.

          So yeah, this checks out.

      • drpixie 6 hours ago
        Few thousand? Really? Larousse Gastronomique is 3600 pages, averaging maybe 1 recipe and 1 ingredient per page - and that's just classic French cooking.
      • mrhottakes 16 hours ago
        Citations needed
        • boca_honey 11 hours ago
          Less than 7 thousand ingredients [1]. Even if you think it's way more than that on account of underrepresented cultures (which does not seem to be the case), it's still just a few thousand. Most cuisines use around 50 of them.

          [1] https://flowingdata.com/2018/09/18/cuisine-ingredients/

          Total number of techniques should be in the hundreds (not thousands), to be generous, even accounting for historical / prehistorical techniques.

  • throwme_123 21 hours ago
    I would not trust a model/corpus about food that includes English and German, but excludes Italian and French
    • 0cf8612b2e1e 21 hours ago
      The table lists Italian, French and other cuisines. The 11 source listing is for language. The resource is missing authentic French-language described recipes, but there is surely an English-language French onion soup.
  • Retr0id 22 hours ago
    > [Claude] performed all ingredient classification under deterministic decoding (temperature 0–0.1)

    Not that it matters much in this context, but low-temperature is not the same thing as deterministic.

    • cubefox 22 hours ago
      Yep. Zero temperature is neither necessary nor sufficient for deterministic inference.
      • cj 22 hours ago
        Why?
        • tempay 22 hours ago
          You can seed the randomness are still having nonzero temperature.

          Numerical instability can introduce randomness especially on GPU like hardware unless you’re very careful about how you write your algorithms.

        • vessenes 20 hours ago
          In any batch inference environment that includes experts, expert routing may vary depending on what else is in the batch. For one thing.
  • arjie 17 hours ago
    I had Claude Code implement this for me with the data and information from there and it seems all right. Maybe it works well for substitutions rather than recipe construction: https://viz.roshangeorge.dev/recipe-model/
  • haaz 22 hours ago
    Published by Kaikaku, a London based startup doing automated restaurants and cooking
  • chupchap 10 hours ago
    There is such a large discrepancy in the number of recipes in the training data for South Asia vs other macro regions. I wonder what that impact that has on the study. I also wonder why there aren't mor recipes available for this region with such a deep culinary tradition.
  • mrhottakes 16 hours ago
    Similarly, all of human programming can be broken down into 128 characters
  • 21asdffdsa12 21 hours ago
    I forgot this to raise it in the last food related thread - so here is the after-wit: https://www.reddit.com/media?url=https%3A%2F%2Fpreview.redd....

    The triangle of flour - milk and egg- held eggnog, but eggnog contains alcohol, which is made of starches, usually flour.. thus being percentage-wise closer to flour then displayed. Yes, so much on the spectrum..

  • woile 13 hours ago
    I maintain a small website to share recipes: https://reciperium.com

    I wonder how could I apply this? I don't fully grasp what it is. Like what is the next best ingredient? Like, what could you add to it?

  • cuechan 21 hours ago
    I don't really understand, what the Graphs on page 9 and 13 represent, but they look somewhat like a world map with the continents. I wouldn't be surprised if there's actually a geographic connection. A lot of ingrediants are probably more prevalent in certain world regions.
  • nyokki 21 hours ago
    As someone learning to cook from recipes in multiple languages, this is really cool. Curious how it handles the same ingredient called by different names (e.g., "scallion" vs "green onion" vs "long onion").
  • BigTTYGothGF 18 hours ago
    Looking at Table 2 I wonder why Chinese and Korean are similar enough to go in one huge bucket (1.5M) but Japanese is distinct enough to get it's own tiny bucket (33K).
    • scythe 17 hours ago
      Two major contributing factors I can think of:

      - land meats were all but banned in Japan for centuries prior to Perry's ultimatum, encouraging the development of alternatives in flavor and nutrition like natto and katsuobushi

      - geographically, Japan had less access to land crops (even wheat was not common!) and more access to fish and seaweed than Korea

  • dolmen 13 hours ago
    This is about cooking but... no French?
  • vitto_gioda 21 hours ago
    Why haven’t you analyzed Italian recipes in Italian?
  • moffers 21 hours ago
    At uncook we’re in the middle of glamming up our ingredient normalization pipeline, so this is VERY welcome right now
  • foresto 17 hours ago
    This title evokes memories of The Twilight Zone's To Serve Man episode.
  • subscribed 20 hours ago
    Clickbait title, post should be removed.
  • skinfaxi 22 hours ago
    Cooking/recipes seems like it would be an excellent application for a specialized model.
  • talktalkmake 20 hours ago
    All of recorded human cooking compressed into 2 megabytes

    We've lost more than we know

  • vrganj 21 hours ago
    You can use it to browse flavor combinations here, seems quite neat!

    https://epicure.kaikaku.ai/

    That being said, I'm not excited about the idea of this being used to automate cooking somehow.

    Food, to me, is part of what makes us human, where we express our soul for lack of a better word.

    The idea of taking that away feels like robbing us of our humanity.

  • jnmandal 16 hours ago
    Title is pretty clickbaity. Cool data set though
  • sumeno 18 hours ago
    It makes me take the paper less seriously when one of the authors is submitting it with an inaccurate clickbait title.
  • zabzonk 20 hours ago
    WTF? I found this a parody. Perhaps because I've just been re-reading C.S. Lewis's "That Hideous Strength", where language becomes meaningless, I can't stand the design of the site or the text it presents.

    Making a nice lentil soup doesn't require any thought or description. I know that I, and millions of cooks in Asia will do it with just their hands.

  • prmph 19 hours ago
    "All of human cooking" how?

    Does it have African ingredients??

    Clickbait

  • selectedambient 18 hours ago
    this seems more like the very tip of the iceberg here v a complete database, anyhow very neat
  • rcarmo 20 hours ago
    It’s soup, right? Has got to be soup.
  • elzbardico 9 hours ago
    May we call it "To Serve Man"?
  • eob 14 hours ago
    These comments are so filled with cool projects and visualizations I’ll throw my own out there:

    Trying to generate non-slop, print-worthy recipe books for different diaspora communities.

    www.robotbookclub.com

    I’m pretty sure the recipes have surpassed slop. The photos are pretty close. The layouts, intros, and “chef photos” need a lot of work though.

    • dolmen 13 hours ago
      It is good enough to have much fun reading it. About chef photos, in Cook French in America, page 11: no eggs on a breakfast tartine.

      As a French, I would like the French version.

  • pfdietz 22 hours ago
    Cooking condensed beyond the point of usefulness.

    It's another book for Zach Weinersmith.

  • jweisbin 22 hours ago
    "human cooking"? ewww
    • jagged-chisel 21 hours ago
      To help you out, this is distinctly different from “cooking human”.
  • jonstewart 21 hours ago
    Jacques Pepin's knuckles don't compress.
  • NopIdoN 18 hours ago
    I just want say a week's worth of human cooking compressed into a pellet
    • esafak 16 hours ago
      Meal replacement powders.
  • h_a_n_k 16 hours ago
    this is the goat, the pinnacle of ai use, i'm all for it, haha. honestly one of my fave things i've seen
  • 1970-01-01 22 hours ago
    11 sources is not "all of" anything. You have a sample. The title is horrible. Fix the title please.
  • antirez 22 hours ago
    Odd not including French and Italian recipes.
    • TripleH 21 hours ago
      As soon as you start adding our beloved french recipes, frogs, snails and other oddities might substantially increase the 1,790 ingredients count
    • walthamstow 21 hours ago
      French and Italian languages. There are many recipes from both cuisines written in English which, I assume, will have been included.
      • antirez 21 hours ago
        Indeed, but I bet many were never translated.
    • Moosdijk 20 hours ago
      it's a clickbait title
  • suddenlybananas 22 hours ago
    I don't see why the title needs to be quite so grandiose.
    • muragekibicho 22 hours ago
      It's an appeal to the attention economy. "All of human cooking compressed into 2 MB" is(mentally) palatable relative to "Navigating the Emergent Geometry of Food Ingredient Embeddings".

      Getting you to click is the ultimate goal.

      • userbinator 10 hours ago
        The latter alternative might get even more clicks on HN.
      • subscribed 20 hours ago
        Should be downvoted/flagged and buried.
  • 6d6b73 17 hours ago
    "11 sources spanning seven languages, English, Chinese, Russian, Vietnamese, Spanish, Turkish, Indonesian, German, and Indian-English"

    1. English food is not really food, should not be included. :)

    2. 11 sources is nowhere near close to "all of human cooking"..

  • ekm2 14 hours ago
    All of human cooking

    With no African recipe.

  • baalimago 21 hours ago
    Great, so now chefs are being replaced too..!
  • system2 16 hours ago
    BS Title. 30 pages of summary is not all human cooking.
  • threwrfaway 14 hours ago
    [dead]