- Not So Artificial
- Posts
- Windsurf Rides the AI Wave
Windsurf Rides the AI Wave
Plus: Check these AI Tools
Good morning, AI enthusiasts! đ¤ Ready to dive into the wild world of artificial intelligence? From groundbreaking launches to global expansions, itâs a whirlwind out there, and weâre here to make sense of it all. Letâs get started.
Want to get the most out of ChatGPT?
ChatGPT is a superpower if you know how to use it correctly.
Discover how HubSpot's guide to AI can elevate both your productivity and creativity to get more things done.
Learn to automate tasks, enhance decision-making, and foster innovation with the power of AI.
đââď¸ Windsurfâs In-House AI Takes the Helm
Making waves in the coding seas.
AI coding platform Windsurf just launched SWE-1, its first family of in-house AI models specifically designed to assist with the entire software engineering lifecycleânot just code generation.
The Breakdown:
The SWE-1 family features three models: SWE-1 (full-size, for paid users), SWE-1-lite (replacing Cascade Base for all users), and SWE-1-mini.
Internal benchmarks show SWE-1 outperforms all non-frontier and open-weight models, just behind Claude 3.7 Sonnet.
Unlike traditional models focused on code generation, SWE-1 is built to handle multiple environments: editors, terminals, and browsers.
Its âflow awarenessâ system creates a shared timeline between users and AI, enabling seamless handoffs during development.
Why it matters: Windsurf is moving beyond being just an app layer for third-party models. This bold move comes days after a rumored $3B acquisition by OpenAI. Clearly, thereâs more behind that deal than meets the eye.
đ Poe Usage Charts: AI Popularity Shifts
Model wars are heating up.
AI platform Poe just released its Spring 2025 Model Usage Trends report, shedding light on major shifts in AI preferences across text, reasoning, image, and video generation.
Key Takeaways:
GPT-4.1 and Gemini 2.5 Pro captured 10% and 5% of message share within weeks of launch. Meanwhile, Claude saw a 10% drop in the same window.
Reasoning models surged from just 2% to 10% of all text messages since January, with Gemini 2.5 Pro dominating a third of the subcategory.
Image generation saw GPT-image-1 gain 17% usage, directly challenging Black Forest Labsâ FLUX and Googleâs Imagen3.
In video, Chinaâs Kling family took over with ~30% usage right after release, while ElevenLabs still holds 80% of audio.
Why it matters: Poeâs report is a real-world snapshot of user preferences, highlighting how quickly new models can shake up the leaderboard. At this rate, next quarterâs list might look completely different.
đľâđŤ LLMs Struggle with Back-and-Forth Chats
Turns out, patience isnât a strong suit.
A new study from Microsoft and Salesforce researchers revealed that LLMs seriously underperform during multi-turn conversations where instructions are gradually revealed, often getting âlostâ and failing to recover.
Study Highlights:
15 leading LLMs, including Claude 3.7 Sonnet, GPT-4.1, and Gemini 2.5 Pro, were tested across six generation tasks.
Models hit 90% success in single-turn scenarios, but that plummeted to 60% during multi-turn exchanges.
The main issue? LLMs tend to jump to conclusions, building on incorrect assumptions without recalibration.
Why it matters: This exposes a blind spot in LLM capabilities, proving that real-world, multi-turn dialogues are still a massive challengeâsomething developers need to factor into design.
đ¨ââď¸ Worldâs First AI-Doctor Clinic Opens in Saudi Arabia
Virtual medicine is here.
Chinese tech firm Synyi AI has launched the worldâs first AI-guided medical center in Saudi Arabia, marking its debut in the international market.
The Scoop:
The clinic features a virtual doctor, Dr. Hua, who handles initial diagnoses and drafts treatment plans for review by a human physician.
Currently focused on 30 respiratory conditions, with plans to expand to 50 by yearâs end.
Why it matters: Synyi AI is setting the stage for global AI-driven healthcare. This Saudi launch could pave the way for a new era of automated medical services.
Other News
đ¤ You.comâs ARI outperforms OpenAIâs Deep Research with a 76% win rate, adding enterprise features.
đ Meta delays Llama Behemoth to Fall, citing performance issues.
đ OpenAI launches OpenAI to Z Challenge with a $250k prize for discovering archaeological sites.
đ˘ Salesforce acquires Convergence AI, integrating it into Agentforce.
đ Intelligent Internet debuts II-Medical-9B, a small, local-run medical model comparable to GPT 4.5.
đ¨ Manus AI introduces image generation for step-by-step visual planning.
đ° NVIDIA locks in chip deals with Saudi Arabiaâs Humain and the UAE, following strategic meetings.
đť Mastery tools of the day
TikTok AI Alive - Turn static images into dynamic videos for TikTok Stories
CodeRabbit - AI code reviews directly in Cursor, Windsurf, and VSCode
LegoGPT - Create stable, buildable LEGO designs from text prompts
Emergent - Worldâs first agentic vibe coding platform â taking you from idea to fully functional application â ready for real users
đĄWhat else are we reading and seeing?
Working on Complex Systems
Stack overflow is almost dead
LLMs are Making Me Dumber
Microsoft pulls plug on Bing Search APIs
YouTube launches weekly top podcast list to rival Spotify and Apple
Microsoft's CEO on How AI Will Remake Every Company, Including His
If AI is so good at coding ⌠where are the open source contributions?
Robot chefs take over at South Korea's highway restaurants, to mixed reviews
Thatâs it for today, folks! As always, stay curious, stay informed, and keep pushing the boundaries of what AI can do. See you tomorrow! âď¸

