25 Years of Eggs

(john-rush.com)

50 points | by avyfain 3 days ago

6 comments

  • cheschire 50 minutes ago
    Absolutely loved the article, the process, and the results. Hated the price.

    You could pay a human to read receipts, 1 every 30 seconds (that’s slow!), $15/hr (twice the US federal minimum wage!), plus tax and overhead ($15x1.35) comes out to $20.25/hr over 5 hours. $101 all in.

    Sure, sure, a human solution doesn’t scale. But this sort of project makes me feel like we haven’t hit the industrialization moment that i thought we had quite yet.

    • wolfram74 10 minutes ago
      I mean, at over 1000% the cost, the machine solution doesn't scale either?
      • cheschire 3 minutes ago
        I think at a certain scale we're talking about switching to local trained models which don't have the same operating costs as running a frontier model for OCR. That would reduce the ongoing costs significantly. Might take longer than 30 seconds to read each receipt if you run multiple passes to ensure accuracy, but could run 24/7/365 without the same tax and administration overhead of humans.

        Spherical cows aside though, I do agree with you that I should not consider scalability as a given.

  • egeozcan 1 hour ago
    I usually avoid shallow comments but I feel like this time it has to be said as a conversation starter: That's a lot of eggs!

    Also ignoring the benefits of subscriptions, an estimate in the magnitude of thousands of dollars for extracting egg prices still makes me feel like we aren't "there" yet. This should have been a problem with a much more efficient solution given the advancements in the AI, data analysis and OCR space. I am sort of disillusioned.

    • sgbeal 1 hour ago
      > This should have been a problem with a much more efficient solution given the advancements in the AI, data analysis and OCR space.

      There's got to be a "it's a chicken/egg problem" joke in there somewhere, but i'm not seeing it.

      • egeozcan 2 minutes ago
        I actually was going to go for the "why did the chicken not cross the road?". Then I wanted to say "because it was in a price negotiation with the author to sell its eggs", but it was too wordy. Then I thought, "because the author had it as an egg before it could hatch", but it was too dark... Then I gave up.

        Well, I guess you cannot make a chicken joke without breaking some eggs (I'll stop now. I'm really sorry, but come on, it's Sunday).

    • f0cus10 36 minutes ago
      [dead]
  • tkgally 49 minutes ago
    I haven't tried it with receipts, but I've gotten excellent OCR results with Gemini 3.0 and now 3.1 on some challenging texts: handwritten letters I couldn't fully decipher myself, vertically printed Japanese texts with tiny furigana readings next to the kanji, a 19th century book in English with extensive use of italics and small caps. Gemini is good at extracting text and formatting from complex layouts, and it might work with egg receipts, too.
  • PowerElectronix 1 hour ago
    Inflation adjusted dsta just comes to tell us that either eggs have been outdoing the CPI for 25 years or that actual CPI is way higher than what the BLS calculates.
  • gib444 44 minutes ago
    > Estimated token cost $1,591

    I can assume this person does in fact NOT need to worry about the price of eggs ?

    • OJFord 24 minutes ago
      I think they worked that back from tokens used, hence the estimation, but their actual billing was Claude Code & Codex subscriptions. (Which probably was also the main contributor to it taking 14 days.)
  • sgbeal 1 hour ago
    > Estimated token cost $1,591 > Confirmed egg receipts 589 > Total egg spend captured $1,972 > Total eggs 8,604

    ...

    > I can’t wait to see what 30 years of eggs looks like.

    At $2.70 per receipt, i'd be in no hurry to find out!