• Anthropic accuses DeepSeek, other Chinese AI developers of ‘industrial-scale’ copying - Claims ‘distillation’ included 24,000 fraudulent accounts and 16 million exchanges to train smaller models 16 million exchanges to train various models. • Get Tom’s Hardware’s best news and in-depth reviews, straight to your inbox. • You are now subscribed Your newsletter sign-up was successful Anthropic on Monday accused three leading Chinese developers of frontier AI models of using large-scale distillation to improve their own models by using Anthropic’s Claude capabilities. • In total, DeepSeek, Moonshot, and MiniMax made 16 million exchanges using 24,000 fraudulent accounts. • Distillation is a machine learning technique in which a smaller or less capable model is trained on the outputs of a stronger model instead of using actual data to train. • It can save time, create cheaper, more specialized models, extract capabilities from competitors, and/or lower requirements for hardware capabilities.
Article Summaries:
- Anthropic announced on Monday that it has identified three Chinese AI developers-DeepSeek, Moonshot, and MiniMax-as having engaged in large‑scale “distillation” of its Claude model. The company claims the firms used 24,000 fraudulent accounts to make 16 million API exchanges, training smaller models on Claude’s outputs. Anthropic says the activity violates U.S. export controls and its end‑user license agreement, and could remove safety safeguards. While distillation is a legitimate technique for creating cheaper, specialized models, Anthropic argues that the Chinese firms’ use was illicit and aimed at extracting capabilities for military or surveillance applications.
Sources: