Implementing a clear room Z80 / ZX Spectrum emulator with Claude Code

(antirez.com)

21 points | by antirez 1 day ago

6 comments

ralferoo 1 hour ago
The problem is that it will have been trained on multiple open source spectrum emulators. Even "don't access the internet" isn't going to help much if it can parrot someone else's emulator verbatim just from training.
Maybe a more sensible challenge would be to describe a system that hasn't previously been emulated before (or had an emulator source released publicly as far as you can tell from the internet) and then try it.
For fun, try using obscure CPUs giving it the same level of specification as you needed for this, or even try an imagined Z80-like but swapping the order of the bits in the encodings and different orderings for the ALU instructions and see how it manages it.
[-]
- PontifexMinimus 7 minutes ago
  > try using obscure CPUs
  Better still invent a CPU instruction set, and get it to write an emulator for that instruction set in C.
  Then invent a C-like HLL and get it to write a compiler from your HLL to your instruction set.
- dist-epoch 1 hour ago
  If you did that, comments would be "it's just a bit shuffle of the encodings, of course it can manage that, but how about we do totally random encodings..."
  [-]
  - ralferoo 56 minutes ago
    That's true, but I still think it'd be an interesting experiment to see how much it actually follows the specification vs how much it hallucinates by plagiarising from existing code.
    Probably bonus points for telling it that you're emulating the well known ZX Spectrum and then describe something entire different and see whether it just treats that name as an arbitrary label, or whether it significantly influences its code generation.
    But you're right of course, instruction decoding is a relatively small portion of a CPU that the differences would be quite limited if all the other details remained the same. That's why a completely hypothetical system is better.
airza 44 minutes ago
You use clean room everywhere in the article and clear room in the title. Is this on purpose?
[-]
- lazide 38 minutes ago
  Literally nothing about it is either, either.
  [-]
  - rustyhancock 23 minutes ago
    Yes for a moment I thought clear room might mean something else for LLMs.
    Essentially they can't do clean room anything!
    You might as well hire the entire former mid level of a businesses programming team and claim it's clean room work
    [-]
    - steve1977 21 minutes ago
      Windows NT is not VMS! Trust me!
      [-]
      - rustyhancock 16 minutes ago
        Had to Google this but I do love a deep cut reference!
        https://www.itprotoday.com/server-virtualization/windows-nt-...
marcus_lam 1 hour ago
The "clear room" framing is the most interesting part of this to me. It's essentially using the AI as a way to rebuild something from spec without looking at existing implementations, which has legal implications for emulation but also tells you something about the model's capabilities.
If Claude can produce a working Z80 emulator from documentation alone, it means the model has internalized enough about processor architecture, instruction sets, and timing behavior to reconstruct a functional implementation. That's qualitatively different from pattern-matching on existing emulator code.
From antirez's writing what comes through is that the value isn't "AI wrote my code" — it's that AI made it practical to attempt a project that would have been a multi-week time investment as a weekend experiment. The bottleneck for many side projects isn't knowledge or skill, it's calendar time. Compressing that changes what's worth attempting.
[-]
- lazide 38 minutes ago
  Except the AI was trained - by looking at the implementations? Or do you think Claude never saw the implementations in its training set?
  Because that is exceptionally unlikely.
avadodin 1 hour ago
So what you're saying is that it's not just the machine-readable documentation built over decades of the officially undocumented behavior of Z80 opcodes—often provided under restrictive licenses—it's also the "known techniques and patterns" of emulator code—often provided under restrictive licenses.
rjh29 1 hour ago
No Carmack or Stallman. Just the right person at the right time.
dist-epoch 1 hour ago
> I believe automatic programming to be already super-human, not in the sense it is currently capable of producing code that humans can’t produce, but in the concurrent usage of different programming languages, system programming techniques, DSP stuff, operating system tricks, math, and everything needed to reach the result in the most immediate way.
As HN likes to say, only a amateur vibe-coder could believe this.
[-]
- Zafira 1 hour ago
  It is really quite something how many people that have earned credibility designing well-loved tools seem to be true believers in the AI codswallop.