Jargonic Sets New SOTA for Japanese ASR

(aiola.ai)

19 points | by four_fifths 21 hours ago

3 comments

  • 1317 19 hours ago
    SOTA: not used in the article but probably State Of The Art

    ASR: Automatic Speech Recognition, speech-to-text

    • lenerdenator 19 hours ago
      And here I was, as a ham radio operator, excited to read something about Summits On The Air.

      shuffles dejectedly back to shack

  • rfv6723 19 hours ago
    Why no comparition to gpt-4o-transcribe?

    If you don't compare to latest model on the market, how can you claim it's SOTA?

    According to OpenAI, gpt-4o-transcribe has much better performance than whisper-large-v2.

    https://openai.com/index/introducing-our-next-generation-aud...

  • albertzeyer 19 hours ago
    Are there any details on what they changed to improve over other existing models?