ThirdBrAIn.tech

ThirdBrAIn.tech

Search

❯

❯

❯

❯

❯

Anthropic Claude 3 Released! Did It pass the Coding Test?

Apr 02, 20252 min read

Anthropic Claude 3 Released! Did It pass the Coding Test?

AI Summary

Introduction to Anthropics’ Cloe 3 update

Features model overview, coding test, logical reasoning test, and safety test

Cloe 3 Opus outperforms GP4 in benchmarks

Three versions of Cloe 3 released

Opus: Most expensive, highest intelligence score, instant results for live chats, auto-completions, and data extraction

HighQ: Cheapest, fastest, reads data-dense papers in under 3 seconds

Sonet: Twice as fast as Clae 2/2.1, multimodal (handles images and text), higher intelligence

Comparison with GP4

Cloe 3 Opus has improved accuracy and a larger context window of 200,000 tokens (1 million for specific cases)

Cost: $15 p er in p u tt o k e n,$ 75 per output token

Opus and Sonet available via API, HighQ coming soon

Higher recall accuracy in Cloe 3 Opus

Testing Cloe 3

Python programming challenges from very easy to expert level

Cloe 3 successfully passes most tests, with some exceptions due to potential Python version issues

Logical reasoning and safety tests

Cloe 3 performs well in logical reasoning scenarios

Demonstrates safe responses by refusing to provide instructions for illegal activities

Conclusion

Cloe 3 is a robust model with multimodal capabilities and function calling

Encouragement to subscribe, like, and share the video for further updates on function calling features

Graph View

Backlinks

No backlinks found

Created with Quartz v4.2.3 © 2025

GitHub
Discord Community