V4-Pro Achieves 67% Coding Pass Rate in Internal Dogfooding Test, Approaching Opus 4.5 Performance

Gate News message, April 24 — V4 has publicly disclosed internal dogfooding data for its V4-Pro model. The company collected approximately 200 real-world engineering tasks from over 50 engineers, covering feature development, bug fixes, refactoring, and diagnostics across tech stacks including PyTorch, CUDA, Rust, and C++. After rigorous filtering, 30 tasks were retained for the benchmark evaluation.

V4-Pro-Max achieved a 67% coding pass rate, significantly outperforming Sonnet 4.5 at 47% and approaching Opus 4.5 at 70%. However, it trails Opus 4.5 Thinking (73%) and Opus 4.6 Thinking (80%), while substantially exceeding Haiku 4.5 at 13%.

In an internal survey with 85 respondents, all participants reported using V4-Pro for agentic coding in daily workflows. 52% endorsed V4-Pro as their default primary coding model, 39% leaned toward approval, and less than 9% expressed disapproval. Reported issues included low-level errors, misinterpretation of ambiguous prompts, and occasional over-thinking behavior.

Disclaimer: The information on this page may come from third parties and does not represent the views or opinions of Gate. The content displayed on this page is for reference only and does not constitute any financial, investment, or legal advice. Gate does not guarantee the accuracy or completeness of the information and shall not be liable for any losses arising from the use of this information. Virtual asset investments carry high risks and are subject to significant price volatility. You may lose all of your invested principal. Please fully understand the relevant risks and make prudent decisions based on your own financial situation and risk tolerance. For details, please refer to Disclaimer.

Related Articles

OpenClaw Releases v2026.4.29 on April 29, Upgrades Memory to Personalized Wiki with Relationship Tracking

According to Beating, open-source AI assistant OpenClaw (GitHub 367K stars) released v2026.4.29 on April 29, marking its second update in two days. The memory system evolved from simple retrieval-based recall to personalized wiki, enabling agents to automatically build character profiles and track r

GateNews45m ago

Musk Testifies xAI Used OpenAI Models to Train Grok

Elon Musk testified Thursday in California federal court that his artificial intelligence company xAI partly used OpenAI models while training its Grok chatbot, according to TechCrunch. The admission represents a rare public acknowledgment by a major AI developer of a practice under growing

CryptoFrontier3h ago

Google CEO Pichai reveals that using Gemini AI to understand human nature helps build more sincere communication

Pichai said that before important meetings, he uses Gemini’s perspective to analyze and predict the other party’s psychology, thereby improving empathy and enabling more sincere communication. AI agents can also automatically organize emails, scheduling, and summaries, making everyday chores more efficient. Meanwhile, AI platforms centered on open co-creation are emerging; open-source technologies such as Gemini 4 lower the barrier to entry. At the same time, it emphasizes building AI governance frameworks, with governments and society needing to participate to address challenges such as cybersecurity, deepfakes, and sustainability.

ChainNewsAbmedia3h ago

OpenAI Launches Advanced Account Security for ChatGPT

Advanced Account Security Launch OpenAI on Thursday introduced Advanced Account Security, a new opt-in setting for ChatGPT designed for users who want stronger protection or face higher risks of digital attacks. The company said the new feature was created in response to how people are

CryptoFrontier4h ago

X (Twitter) ushers in its biggest ad platform upgrade in 20 years, with xAI involved; AI semantic targeting becomes the core

X announced that it will roll out the largest advertising platform overhaul in 20 years starting in April 2026, rebuilding the underlying technology and combining it with xAI. The new platform will focus on AI-driven performance optimization and semantic and contextual advertising to improve operational convenience and ad placement control. Its goal is to turn advertising into real-time commercial signals in context, and to serve as X’s business engine within the X ecosystem in line with its Everything App strategy.

ChainNewsAbmedia7h ago

OpenAI-Backed 1X Opens 58,000-Sq-Ft Factory in California, Targets 10,000 Robots in First Year

According to Bloomberg, 1X Technologies, an OpenAI-backed robotics startup founded in Norway, has opened a 58,000-square-foot manufacturing facility in Hayward, California, aiming to lead in mass-producing consumer-grade humanoid robots. The facility is expected to produce 10,000 robots in its

GateNews10h ago
Comment
0/400
No comments