Alibaba to ban Claude Code in workplace over alleged backdoor risks, source says

(reuters.com)

193 points | by nsoonhui 4 hours ago

18 comments

avd201 2 minutes ago
Anthropic has been doing this sort of stuff for a while already. I mean, who remembers when Claude would just consume all your remaining usage if it read anything indicating that Openclaw had been used on your codebase? Because I remember. Two months ago btw https://news.ycombinator.com/item?id=47963204 Then there was the whole debacle of Fable silently downgrading to other models if it detected wrong think, or worse, outright sabotaging your codebase if you were working on language models lol
p0w3n3d 3 hours ago
```
  1. LOL I've just downloaded literally whole internet and copyrighted books and put them through a neural network. Now I have this whole knowledge in my LLM.

  2. Hey? Are you using my NN for training your NN? you're a thief!
```
[-]
- matheusmoreira 1 hour ago
  Remember how Kim Dotcom got destroyed for criminal copyright infringement? One would think the big tech CEOs would face the same fate, that police officers would rappel down helicopters, storm their mansions and bring them out in cuffs.
  Instead the AI companies reached these absurd settlements with publishers that made a mockery out of all the previous copyright enforcement victims.
  [-]
  - root-parent 1 hour ago
    Remember Aaron Swartz who did something that just pales compared to what Dario Amodei, Zuckerberg-Mr-Torrent and Sam Altman did.
    [-]
    - 314 55 minutes ago
      But Aaron Swartz did it for the benefit of other people. These fine people did it to uphold american values and enrich themselves at the expense of others. The law is clearly on their side.
      [-]
      - stingraycharles 31 minutes ago
        I think this comment is missing a /s, right?
      - dan_i 50 minutes ago
        Quick refresher on American values: https://en.wikipedia.org/wiki/Slavery_in_the_colonial_histor...
        [-]
        petcat 23 minutes ago
        Not sure what that is supposed to indicate? USA was a big place, even then. Most northern states had abolished slavery even before Britain, France, and especially Spain did. Maybe we should have a quick refresher on European values?
        zahlman 16 minutes ago
        I thought this thread was about Alibaba's internal policies. How did we get here?
        [-]
        dan_i 6 minutes ago
        [dead]
        batch12 28 minutes ago
        The article talks about European colonies, so would these have been European values then since America did not yet exist?
        [-]
        abenga 1 minute ago
        That's just the label that changed. Same people, same values.
        dan_i 26 minutes ago
        [dead]
    - matheusmoreira 1 hour ago
      Indeed I do. We should all remember him. Rest in peace.
  - datsci_est_2015 18 minutes ago
    The trick here, imo, was the integration with the military industrial complex. It wasn’t very difficult of course, as automation has been a topic in warfare for decades, if not centuries.
    But Eisenhower was right:
    > In the councils of government, we must guard against the acquisition of unwarranted influence, whether sought or unsought, by the military-industrial complex. The potential for the disastrous rise of misplaced power exists and will persist.
  - vlovich123 1 hour ago
    Reminds me, did the AI companies redistribute that copyrighted material to others and make their money that way? Did Kim use the copyrighted material to generate something novel from it?
    copyright law literally says something isn’t infringement if it is a novel transformation. I get the jokes and criticism about AI companies fighting and complaining about competitors distilling, but this is a much weirder comparison.
    [-]
    - root-parent 1 hour ago
      Now you just have to explain this:
      "Anthropic settles with authors in first-of-its-kind AI copyright infringement lawsuit" - https://www.npr.org/2025/09/05/nx-s1-5529404/anthropic-settl...
      [-]
      - hydrox24 5 minutes ago
        > "The training use was a fair use," [the judge] wrote. "The use of the books at issue to train Claude and its precursors was exceedingly transformative."
        > However, the judge ruled that Anthropic's use of millions of pirated books to build its models – books that websites such as Library Genesis (LibGen) and Pirate Library Mirror (PiLiMi) copied without getting the authors' consent or giving them compensation – was not.
        It seems clear from the article that while the use of pirated works was illegal, the use of copyrighted works (a the work a book is based on is still copyrighted if you buy the book) was fine and transformative.
    - andersonpico 34 minutes ago
      But distribution isn't the only crime here, obtaining the material illegally apparently is a crime too. And the damn robot can also spit me harry Potter verbatim so I don't know how it would also not be distribution?
      [-]
      - mapontosevenths 7 minutes ago
        If I read Harry Potter I will remember some parts verbatim. Others I will tecall in only an abridged and lossy way.
        Does that make my brain copyright infringement? Does Disney now own all my output forever because some small part of me now has Harry Potter embedded?
      - vlian2088 23 minutes ago
        >And the damn robot can also spit me harry Potter verbatim so I don't know how it would also not be distribution?
        if you prompt it to, yes. just like your browser dutifully navigates to any copyright-infringing resource and GETs and POSTs whatever you ask of it.
        (also it can't, not really, only small snippets before going off rails. LLMs aren't magic, they can't losslessly compress an exabyte of training data into a few terabytes of weights.)
    - cryptonym 29 minutes ago
      This is confusing. I can torrent everything and do what I want with it, as long as I don't redistribute the exact same thing?
      If so, why do we still pay for games and movies?
      [-]
      - john_strinlai 6 minutes ago
        >I can torrent everything and do what I want with it, as long as I don't redistribute the exact same thing?
        this is an incorrect interpretation (in the usa, at least).
        downloading a game/movie is still the creation of unauthorized copy, which is not allowed. not to mention that playing/watching does not count as a "novel transformation".
      - JsonDemWitOster 12 minutes ago
        IANAL (plus a whole suite of other caveats) but torrent-baiting works in Germany along these lines.
        ISPs and trigger-happy law firms don't send you a C&D for downloading a torrent, they do so for seeding a torrent. It's just that practically nobody "just seeds" a torrent so people colloquially claim they got busted for downloading a torrent.
        In theory this means if you torrent as a 100% leecher and turn off seeding from the get-go, you should be in the clear. But nobody sensible would dare test the extent of German Legal Spite, much less do so repeatedly to science the shit out of it.
        If you can download through another protocol, say HTTP, however---<Sendung unterbrochen!>
      - midasz 18 minutes ago
        I pay for games because it's more convenient than pirating them. For movies and tv however... They make it so difficult to be a customer.
  - codedokode 1 hour ago
    Exactly. If a rich corporation downloads and uses pirated content without paying, why should ordinary person pay for movies and music instead of downloading them for free?
  - Simulacra 45 minutes ago
    He just lost another court case… I wonder if we're getting close to the government spending as much to prosecute the man than what Hollywood possibly lost..
  - xienze 1 hour ago
    Remember how people used to justify their own personal software piracy with arguments like "information wants to be free", "no one stole anything, you still have the data", "I was never going to buy it anyway", and "copyright should be abolished?"
    > Instead the AI companies reached these absurd settlements with publishers that made a mockery out of all the previous copyright enforcement victims.
    Isn't that at least something? How many people pirating software ever settled with the companies they "victimized?"
    [-]
    - monooso 54 minutes ago
      How many people pirating software stole every piece of copyrighted material in existence and then used that material to generate billions of dollars which they kept for themselves?
    - cinntaile 44 minutes ago
      Settlements after the fact, not agreements beforehand.
      No that's not something. That's just having infinitely more money to fight legal battles.
      [-]
      - mapontosevenths 2 minutes ago
        When a crime is only punishable by fines it isn't a crime, it's just an activity with a tax.
        The AI companies knew that and bet, correctly, that it would be worth the cost.
    - matheusmoreira 1 hour ago
      > Remember how people used to justify their own personal software piracy
      A courtesy. There was never any need to justify it.
      > Isn't that at least something?
      Yes, it's a joke. Why do they get to infringe copyrights with impunity while normal people get destroyed? Either go after them like the copyright industry always does and punish them properly, or abolish copyright straight up. This "rules for thee but not for me" nonsense is straight up disgusting.
      > How many people pirating software ever settled with the companies they "victimized?"
      Too many to list. Also, nobody is victimizing billion dollar corporations.
      [-]
      - phoghed 55 minutes ago
        So you don’t actually care, you just want them punished out of spite because some other guy was for doing something similar but not the same?
        [-]
        matheusmoreira 20 minutes ago
        Correct. I'm one of the copyright abolitionists the other person alluded to. It's the selective enforcement that's disgusting.
        I mean, what is this? Their balls suddenly drop off? They only have the audacity to prosecute random people? Smaller companies? When they're up against trillion dollar AI companies they suddenly become cowards? That's so incredibly disgusting, and it made me completely lose even the small amount of respect for copyright that I had managed to rationalize over the years.
    - curtisblaine 12 minutes ago
      No. I want either:
      1. The copyright infringement of big corpos fully justifying my copyright infringement in the face of law
      2. The copyright infringement of big corpos being prosecuted in the same exact way as my copyright infringement would.
      There is really no middle ground.
- yubblegum 1 hour ago
  Whatever happened to honor among theives? What is this world coming to..
- short_sells_poo 2 hours ago
  The corollary is that there are no morals once the stakes are in the $ billions, let alone hundreds of billions.
  This isn't even about a single person or personality. Very few people in such position could stand fast by their moral code. In any case, an environment that favors profit above everything will naturally select for individuals who are unencumbered by such hindrances.
  There might've been 100s of Altmans and Amodeis who had a strong moral code but we don't know about them because they dropped out of the "race" because of said moral hurdles.
  [-]
  - rlpb 2 hours ago
    Copyright law is an artificial legal construct, not a moral code.
    I think appropriate attribution is a moral code, but I am not able to attribute every idea I have to all those who helped me develop the general intelligence that I use to develop such ideas.
    [-]
    - raxxorraxor 1 hour ago
      I think this behaviour has shown that there are no morals involved. Pirate if you want to, just don't get caught if you don't have a giant backing.
  - spinningslate 1 hour ago
    > an environment that favors profit above everything will naturally select for individuals who are unencumbered by such hindrances.
    Exactly. Dairy farms optimise for milk production so favour cows that produce the most milk.
    The market economy optimises for profit so favours those most willing/able to generate it. Zuckerberg, Musk, Thiel, Andreesen and co are products of the system.
- zobzu 33 minutes ago
  hn is now reddit. no substance.
- TZubiri 1 hour ago
  I never get tired of posting this answer because everyone on the internet is adopting this hot take:
  If you look at it with your eyes crossed, Anthropic and the chinese are doing the same thing.
  If you look at it with nuance 1 the chinese are doing way worse stuff, and 2 stealing from a thief would still be stealing
  1. The chinese are making multiple accounts (at least 49,000)[1][2], using proxies/VPNs, possibly using residential computers and infected computers (unless you think the chinese are doing due diligence to ensure their purchased IPs are kosher). All accounts need to be created with a real name, and especially so if the paid models need to be accessed and paid with a credit card. So this is beyond IP theft and getting closer to fraud. These are all techniques that are well studied because they are used by criminals and cybercriminals, textbook stuff. Consider if that was not sufficient, that China is banned from using the product, so they need to use identities and locations not just to avoid relating the accounts between themselves, but merely to allow account creation. What identities are they using to create accounts.
  Compare this to Anthropic which reads notes made a deal in an IP theft case paying billions because they bought books and scanned them but buying the books wasn't sufficient retribution for the authors. Or that they gasp scanned the internet, like Google.
  Not having nuance to see the difference between the two companies is something I expect of the twitter echo chamber copying hot takes for upvotes, not hacker news.
  [1] https://arstechnica.com/tech-policy/2026/06/anthropic-claims... [2] https://www.anthropic.com/news/detecting-and-preventing-dist...
  [-]
  - bildung 59 minutes ago
    What seems to be missing from that take is that a) Alibaba paid for the access b) there is no IP theft because LLM output is not copyrightable.
    Anthropic seems to want to both own and eat its stolen cake.
  - codedokode 54 minutes ago
    First, LLM is merely a tool and its output belong to whoever generated them. If a Chinese researcher used their creativity to generate a response, the copyright belongs to them and AI companies have no rights to it. Second, Chinese release many of their models for free, thus being on a noble mission to make AI available for every country (unlike certain company whose promises were nothing but words). For comparison, US companies do not release anything and want to keep AI for themselves and decide who gets to use it.
    > stealing from a thief would still be stealing
    Stealing from a thief hurts thief industry which is a win for society.
    > The chinese are making multiple accounts
    Not a crime. AI companies also ignore robots.txt and applicable laws when illegally copying copyrighted material from websites to their servers without author permission.
  - xpct 6 minutes ago
    Let's not sane-wash Anthropic's book theft. No, they didn't just 'scan' the internet, they created a tool for worldwide license washing and got fined an insignificant amount for it.
bhouston 2 hours ago
All remote AI are a massive security risk for individuals/companies/governments that may be targeted by the US government.
It is likely that the US will get a live feed from each AI provider that they are inspecting in real time to identity things of interest, terrorist attacks or foreign government planning or even foreign companies competitive to key US companies.
It will give them access to the though process in those companies as well as much of their text-based IP (source code, docs, meeting transcripts, etc)
Also if you are using local AI that you didn’t train yourself you can never be sure it doesn’t have purposeful biases in its reasoning that may disadvantage you - such as directing you away from certain plans or ideas or patents etc.
[-]
- londons_explore 1 hour ago
  It is worth thinking about the fact the total throughput of even a big LLM provider isn't many megabits.
  If a token compresses to around a byte, worldwide AI input and output is around 1 gigabyte per second.
  For any intelligence agency, they can afford to keep and store all of that forever, and later do analysis on it.
  [-]
  - bhouston 14 minutes ago
    > For any intelligence agency, they can afford to keep and store all of that forever, and later do analysis on it.
    At the scale the AI companies are operating at, I think it isn't likely that they are sucking it all in right now.
    More likely I think the intelligence agencies will get a real-time live tap into the raw data feed which they will process onsite for interesting things and then if things are flagged, they will log it in the intelligence agency systems.
- general1465 1 hour ago
  Leakage of IP and training on your data is something what I am pointing out too, but people will turn around and try to smooth me down that TOS does not allow that if you are an enterprise client. Are you really going to believe that AI companies won't ignore TOS, when they were ignoring literal laws which sent others to jail in the past? Especially when more data = better model?
eunos 3 hours ago
What Claude Code did is absolutely mindboggling tho, if Chinese harness did that probably POTUS would lose sleep.
[-]
- usef- 2 hours ago
  It seemed pretty mild compared to what's collected by modern websites and apps, though? How many don't know your Timezone?
  [-]
  - dijit 2 hours ago
    > How many don't know your Timezone?
    The timezone fetch was to alter program behaviour at runtime, not to send arbitrary timezones for tracking reasons.
    It was one way of detecting if it was a chinese person using the program and then behaving differently.
    Malware behaves this way. STUXNET for example was wired to do nothing except propagate unless the environment had the right conditions.
    [-]
    - usef- 1 hour ago
      The article on HN only said that they seemed to be collecting this to detect resellers. How else did the behavior change?
      Most services I know that are trying to block abuse do collect device info
      [-]
      - dijit 1 hour ago
        regardless of anything else, whether what you said is true or not: blocking program execution based on the detected environment is a runtime behaviour change.
        [-]
        usef- 1 hour ago
        Agreed. And it also applies to the "I'm not a bot" checkbox on most websites. And hundreds of other things people use every day.
        [-]
        stingraycharles 27 minutes ago
        Yeah I also believe it’s a big nothing burger. There are far worse things these AI labs have done, detecting when Chinese labs are using Claude Code is not it.
- cognitiveinline 3 hours ago
  Exaggerate much? If you think POTUS would lose sleep about a date format timezone marker, I don't know what to tell you.
- yard2010 3 hours ago
  Wait what do you mean "if"?
- ironbound 2 hours ago
  And I'm the king of France
- youre-wrong3 2 hours ago
  Maybe if they didn’t farm all the data from Claude to train their own trash models. Anthropic wouldn’t feel the need to do it.
  [-]
  - InsideOutSanta 2 hours ago
    Who is "they", and which Chinese models are trash?
  - vrganj 2 hours ago
    Anthropic stole the entire internet. Excuse my language, but they can fuck right off.
    [-]
    - breppp 2 hours ago
      The issue here is not whether Anthropic used Common Crawl, Alibaba also does that.
      The issue is that by distilling Claude, Alibaba reuses the IP anthropic used to train the model that's more akin to historical Chinese reverse engineering methods and disrespect of IP
      [-]
      - snovv_crash 1 hour ago
        Alibaba paid for that data though, right? They didn't hack Anthropic, they bought accounts and ran them normally.
        Also, you can't copyright AI outputs. So worst case they violated the ToS.
      - blackoil 1 hour ago
        'Issue' for who?
      - vrganj 2 hours ago
        Anthropic clearly doesn't respect other people's IP, it's real rich that they now insist on theirs being worthy of protection.
        Fwiw, I think the concept of IP in general is counter to human progress.
        [-]
        kataklasm 2 hours ago
        The practical implementation of IP? Sure, that's debatable. But the concept of IP is rooted in favoring progress. The thought process being, that if one's intellectual work can be copied and reused and modified and what not without issues, why should anyone invent things anymore? Just wait for the next person to do it and then copy their work, that's way less effort than inventing things yourself. IP aims to protect progress by making sure inventors have actual incentive to invent stuff. They way it's implemented is fundamentalst flawed, I agree, but the concept itself? I'm not so clear on that
        breppp 2 hours ago
        It's more complicated than that because Google has been legally displaying other people copyrighted material for years.
        In any case there's still a difference between publicly available copyrighted data and whether you can use it for model training, and the innovation around model training, RLHF, etc which you presumably have some interest as a country to allow companies to invest in with some legal protections (like the diff between patent law vs copyright law)
        [-]
        platinumrad 1 hour ago
        So you're saying it's more important to safeguard slop outputs than the original work of human beings.
        [-]
        breppp 41 minutes ago
        No, I am saying that there is a good chance that for the good of humanity, society decides that for miracle AGI we collectively forfeit copyright in LLM training yet IP protections for model development is still kept.
        There are many cases in the early 2000s were copyright protections were relaxed for tech advancements
      - matheusmoreira 2 hours ago
        > reuses the IP anthropic used to train the model
        > disrespect of IP
        Nobody other than Anthropic cares.
      - messe 2 hours ago
        > Alibaba reuses the IP anthropic used to train the model that's more akin to historical Chinese reverse engineering methods and disrespect of IP
        Why is this any worse than Anthropic's disrepect of IP? You've apparently drawn a distinction between the two here, but I'm failing to see what it actually is.
        [-]
        breppp 33 minutes ago
        Copyright law and IP law is not the same although everyone seem to conflate the two.
        Search engines for example historically ignored copyright law by copying excerpts or serving other site images, it doesn't mean someone copying Google's code has some moral frepass
        [-]
        messe 26 minutes ago
        > Copyright law and IP law is not the same although everyone seem to conflate the two.
        Copyright law is a subset of IP law. What IP is being infringed upon here?
        > Search engines for example historically ignored copyright law by copying excerpts or serving other site images
        Excerpts are often considered fair use, but it depends on country.
        > it doesn't mean someone copying Google's code has some moral frepass
        Nobody copied Anthropic's code. They used it's output to train another model. At most they violated some terms of service.
        Did they maybe abuse Anthropic's subsidised pricing? Sure. But that's what happens in a free market if you sell below cost.
        [-]
        breppp 19 minutes ago
        > Excerpts are often considered fair use, but it depends on country.
        That had happened progressively, thumbnails for example were ruled as fair use later on, DMCA safe harbor was a huge gift for tech companies because otherwise it would curtail the ability to create platforms (relaxing copyright protections in exchange of innovation)
        > Nobody copied Anthropic's code. They used it's output to train another model. At most they violated some terms of service
        Distilling a model is a method that can push the entire market to low margins and prevent companies from making money off such research. It also copies the Anthropic special parts (RLHF and other specific methods) rather than the "copy of the entire web" part
        This is similar to what happened with Chinese reverse engineering of American manufacturing or PC clones killing IBM PCs.
        Is it in the interest of the USA, probably no, that's why I assume this will be backed by law eventually
        [-]
        messe 10 minutes ago
        > Distilling a model is a method that can push the entire market to low margins and prevent companies from making money off such research
        Then it's on Anthropic to actually price their models accordingly so that distilling isn't profitable. Why does this need a legal remedy when market forces could easily resolve this?
        > Is it in the interest of the USA, probably no
        Good. The world needs to diversify away from dependence on US technology.
johnathan101 3 hours ago
Regardless of whether this specific claim is true, enterprises are becoming much more cautious about developer tools that can read large portions of proprietary codebases.
[-]
- soraminazuki 2 hours ago
  It's insane that it's becoming a concern now. It should've ended the discussion from the very beginning.
  [-]
  - yurish 2 hours ago
    Enterprises host their entire infrastructure on US-base clouds. And for many, it still is not a problem.
  - dan_i 1 hour ago
    [dead]
- pmontra 27 minutes ago
  After they uploaded their code to private repositories on GitHub, Bitbucket etc since forever?. They trust GitHub not to read their code but they don't trust an AI from Microsoft not to read it? It would be schizophrenia
- saidnooneever 2 hours ago
  not to mention they are kind of capable of executing code and susceptible to injections which also amounts to being practically backdoors if youre not super careful about how u use the tooling
- spwa4 3 hours ago
  Wasn't one of the big promises the AI labs made "uncopyrighting"? Ie. the ability to reconstruct large works, including source code, without actual access to the source code? Everything from movies to operating systems.
  [-]
  - xpct 4 minutes ago
    Interesting, I haven't heard this claim before. I suppose that claim made sense if their customers were big corporations, not so much when its the masses generating bootleg software copies.
  - silon42 2 hours ago
    Cleverly compressing and decompressing doesn't de-copyright it. ... and if it's not the same who'd trust it.
- llm_nerd 3 hours ago
  Becoming? We've moved entirely in the opposite direction.
  When these tools first appeared the overwhelming conversation was about the risk of letting a remote tool siphon your code and intellectual property (where eventually they're going to add that to their training). Now everyone is using them, and that fear seems to have dissolved. Every corporation is sprinkled with Claude Code, Antigravity, Copilot, Codex, and so on. Even the long fear-mongered Chinese providers are being heavily used in many spaces.
  In this case this is a PR battle between two firms, and it isn't much more. And Alibaba isn't worried about the "proprietary code" (the truth is that there is incredibly little interest in most orgs code), but that the tool is a backdoor, or at least that is the claim.
  [-]
  - DanielHB 2 hours ago
    > there is incredibly little interest in most orgs code
    I think from a commercial perspective yes, but access to source code is very good for finding exploits which could be very valuable for governments. I could also see a future where companies are directly cyber-attacking competitors in hostile markets too...
  - otabdeveloper4 2 hours ago
    > and that fear seems to have dissolved
    Until the first big incident, yes.
jdw64 2 hours ago
I got curious and asked my Chinese friends, and they gave me a Reddit link[1]. It looks like it's about location data collection, and they suggested that might be the reason for the issue.
[1]https://www.reddit.com/r/ClaudeAI/comments/1ujila1/anthropic...
[-]
- swingboy 1 hour ago
  There was a big thread about it here the other day. https://news.ycombinator.com/item?id=48734373
  [-]
ravenstine 2 hours ago
Employers in 2022:
> No! Don't install that lodash thing without explicit approval from IT. Oh, you want a license for Charles Proxy? Gee, I dunno... we've got a budget to maintain.
Employers in 2023:
> No! You can't use ChatGPT at work – it's a security risk.
Employers in 2024:
> Okay, you can use Github Copilot I guess, but you'll have to endure boring corporate training on what you're allowed to do with it.
Employers with dollar signs in their eyes in 2025:
> We attended a seminar about vibe coding. Why aren't you dumbasses keeping up with the times? Use Claude Code for everything! Don't write any of your own code anymore. We don't even really care if you use yolo mode. Just review code and push 10x more features! Use unlimited tokens! Money printer go brrrrr.
Employers in 2026:
> You mean giving one or two companies full autonomous access to our workstations while stupifying our engineers wasn't a sound business plan?
[-]
- dan_i 2 hours ago
  2025 taught me that my employer would replace me with a slave if they could get away with it.
  The confusing part to me is why these companies believed the "AGI" hype, I.E. that OpenAI or Claude's LLM is the ideal white collar slave.
  I suppose I can understand that the executive class resents labor enough to make irrational business decisions for the purpose of insulting the workers who design and operate their companies.
  That being said, the 2025 AI binge feels like a murder-suicide done by the executives of many of these companies.
khurs 1 hour ago
Snowden files revealed NSA collect everything they can.
Of-course USA is collecting everything, not just from China but everyone.
And same with every one else.
bushido 1 hour ago
What's very interesting to me is these moves will introduce a good amount of doubt in future claims by Claude etc, that the open source and non-US models are only getting better because they're distilling from frontier labs.
Simulacra 46 minutes ago
https://archive.is/bmyny
feverzsj 3 hours ago
Considering their massive distillation, if US companies stop publishing new models to the public, would China still be able to develop new open weight models?
[-]
- bel8 3 hours ago
  I don't think China would strugle to scrape the internet for fresh data.
  And they constantly publish state of the art LLM research (see DS4 context compaction and cache tech).
  They have very capable tech giants. So while not being able to distill western models would probably have some impact, it's probably becoming lesser as time passes.
  We might even see Western LLMs distilling Chinese models soon. If they aren't already to some extent.
- VortexLain 33 minutes ago
  Depends on a lab, but they do have plenty of compute and engineering. So this would only slow down the progress.
- tristanj 3 hours ago
  Yes, 100%. GLM 5.2 is capable of RSI. It's too late to stop.
- pjmlp 2 hours ago
  Of course, it is like any other kind of weapon system, eventually the knowledge gets acquired.
- surgical_fire 2 hours ago
  Probably yes.
  More than a year ago, when Anthropic and OpenAI started to hide the reasoning bits from the output, a lot of people here on HN predicted that Chinese models days were numbered.
  Fast forward to today, and models such as DeepSeek and MiMo are nothing short of excellent. I haven't used GLM or Qwen but heard very good things about them as well.
  This "massive distillation" sounds a lot like anxiety about how companies from outside the US can develop very good models themselves.
  [-]
  - VortexLain 32 minutes ago
    In my personal, subjective opinion GLM-5.2 is on par with GPT-5.3
- margorczynski 3 hours ago
  China has most probably already achieved "escape velocity" on the software side. Now if they achieve parity, to some degree at least, on the hardware side with Nvidia it is very possible they'll overtake the US.
yanhangyhy 4 hours ago
i gonna ask: how can they still use claude? i thought all users in china are banned
[-]
- dgellow 3 hours ago
  Alibaba has engineers in Hongkong, Singapore, North America. It’s a global corporation
  [-]
  - itake 3 hours ago
    when i was in hongkong, chatgpt and gemini were disabled. Maybe this has changed though. When I was in China, the corporate vpn (zscaler) routed traffic through hk
- xyzsparetimexyz 3 hours ago
  https://www.chinatalk.media/p/how-to-buy-cheap-claude-tokens...
- bravetraveler 4 hours ago
  Same way every ban is evaded, smurfing
- playnuu9 4 hours ago
  There is a reason Singapore tops the rank on Claude usage
  [-]
  - byzantinegene 3 hours ago
    the government also actively promotes AI usage in work environments
  - chinathrow 1 hour ago
    Source?
- _flux 4 hours ago
  Does Alibaba only have developers in the China?
  [-]
  - one33seven 3 hours ago
    Did china invent VPNs yet?
- josh-wrale 4 hours ago
  Cc can be used with non Anthropic models.
- dist-epoch 3 hours ago
  The same way they buy "banned" and "sanctioned" NVIDIA GPUs.
- TZubiri 1 hour ago
  one possibility:
  iproyal.com Oxylabs.io
  https://krebsonsecurity.com/2025/10/aisuru-botnet-shifts-fro...
- re-thc 4 hours ago
  > how can they still use claude?
  Workarounds aside, it says Claude Code not Claude.
  i.e. they are using the CLI running any model. You can for instance run GLM with it.
rvnx 4 hours ago
Can't say they are wrong, after the latest backdoor, or let's say, undocumented functionality that leaks some data that was pushed in Claude Code few days ago
https://news.ycombinator.com/item?id=48759754
[-]
- dgellow 3 hours ago
  That’s not what a backdoor is…
  [-]
  - tpoacher 3 hours ago
    Rear entrance then
  - rvnx 3 hours ago
    When a company can remotely push code without explicit user approval, and code that was hostile / almost malicious, it is a backdoor
    [-]
    - jitl 2 hours ago
      so like… any website
rvz 3 hours ago
Another reason to use open source coding agents and local language models.
Claude Code is neither and it is literally info stealing malware.
aivisibility96 36 minutes ago
[flagged]
HlessClaudesman 3 hours ago
Translation: Alibaba will continue distillation attacks using accounts that aren't directly attributable to it's own corporate infrastructure.
[-]
- ampersandwhich 2 hours ago
  I think we should start calling it "distillation terrorism" just to make it sound even more absurd.
  [-]
  - InsideOutSanta 2 hours ago
    It's pure model murder, and if you call it anything else, you're an anti-American communist.
- lelanthran 2 hours ago
  > Translation: Alibaba will continue distillation attacks using accounts that aren't directly attributable to it's own corporate infrastructure.
  What's a "distillation attack"? How is it different from simply distillation?
  [-]
  - kouteiheika 2 hours ago
    It's pretty much the same as when "installing programs on your computer" is called "sideloading". Deliberately deceptive, weaponized language to make it seem like a bad thing.
  - TZubiri 41 minutes ago
    using infected machines as proxies would be a fair line in the sand
  - dizhn 2 hours ago
    The target doesn't want to be distilled.
    [-]
    - julianlam 1 hour ago
      You wouldn't distill a car.
      [-]
      - HlessClaudesman 27 minutes ago
        I would distill all the cars.
    - lelanthran 1 hour ago
      > The target doesn't want to be distilled.
      So?
      Fraudsters don't want to be jailed, their victims don't want to be scammed, employees don't want to be laid off, etc.
      What the target wants is irrelevant - what society wants as enforced by laws is what is relevant, and as the leading AI providers have demonstrated, simply grabbing other people's copyrighted stuff for learning purposes is perfectly fine!
      If they already think this practice is fine, why would I believe that their concerns about this are real?
- RobotToaster 3 hours ago
  (Mis)anthropic already performed "distillation attacks" on the internet.
- vorticalbox 3 hours ago
  i can see why they want to stop it but 1. you have to pay for the "attack" 2. these AI companies trained on copyrighted content without permission or attribution to anyone who's data was used to train.
- exe34 2 hours ago
  As long as they're paying for the tokens, there's no attack . Otherwise you have to call training on copyrighted material theft.
  [-]
  - feverzsj 2 hours ago
    They are not paying for most tokens. The actual users in China do. All they need is the logs.
    [-]
    - InsideOutSanta 2 hours ago
      Anthropic still gets paid.
      Unlike the vast majority of people Anthropic stole from.
    - dizhn 2 hours ago
      In that case it's already bought and paid for by the users, is it not?
- vrganj 2 hours ago
  Did Anthropic perform "distillation attacks" when they hoovered up the entire internet?
- surgical_fire 2 hours ago
  How exactly the word attack fits in that phrase?
mbmbn 1 hour ago
Are they afraid Claude reports on everything they are stealing from the other legit AI companies?
Jeff9James 1 hour ago
Story of Z.ai:
use claude-code see how good it is send 100k bots to distill fable 5 (GLM 5.2 is the result of this) release Zcode ditch claude-code ban claude-code
[-]
- codedokode 47 minutes ago
  The outcome is that we get either free or cheaper model. Good work.
- julianlam 1 hour ago
  [citation needed]