InfoQ Software Architects' Newsletter

A monthly overview of things you need to know as an architect or aspiring architect.

InfoQ Homepage AI, ML & Data Engineering Content on InfoQ

News

RSS Feed

Newer Older

AI, ML & Data Engineering

Hugging Face Introduces RTEB, a New Benchmark for Evaluating Retrieval Models

Hugging Face unveils the Retrieval Embedding Benchmark (RTEB), a pioneering framework to assess embedding models' real-world retrieval accuracy. By merging public and private datasets, RTEB narrows the "generalization gap," ensuring models perform reliably across critical sectors. Now live and inviting collaboration, RTEB aims to set a community standard in AI retrieval evaluation.

Robert Krzaczyński
on Oct 16, 2025
AI, ML & Data Engineering

10 AI-Related Standout Sessions at QCon San Francisco 2025

Join us at QCon San Francisco 2025 (Nov 17–21) for a three-day deep dive into the future of software development, exploring AI’s transformative impact. As a program committee member, I’m excited to showcase tracks that tackle real-world challenges, featuring industry leaders and sessions on AI, LLMs, and engineering mindsets. Don’t miss out!

Hien Luu
on Oct 14, 2025
AI, ML & Data Engineering

Paper2Agent Converts Scientific Papers into Interactive AI Agents

Stanford's Paper2Agent framework revolutionizes research by transforming static papers into interactive AI agents that execute analyses and respond to queries. Leveraging the Model Context Protocol, it simplifies reproducibility and enhances accessibility, empowering users with dynamic, autonomous tools for deeper scientific exploration and understanding.

Robert Krzaczyński
on Oct 14, 2025
AI, ML & Data Engineering

Genkit Extension for Gemini CLI Brings Framework-Aware AI Assistance to the Terminal

Introducing Google's Genkit Extension for Gemini CLI: a groundbreaking tool that delivers framework-aware AI assistance directly to the terminal. Streamline your Genkit application development with context-aware code generation, debugging, and best practices—all without leaving the command line. Unleash productivity and innovation in building generative AI applications.

Hien Luu
on Oct 13, 2025
AI, ML & Data Engineering

GitHub MCP Registry Offers a Central Hub for Discovering and Deploying MCP Servers

GitHub has recently launched its Model Context Protocol (MCP) Registry, designed to help developers discover and use the AI tools directly from within their working environment. The registry currently lists over 40 MCP servers from Microsoft, GitHub, Dynatrace, Terraform, and many others.

Sergio De Simone
on Oct 13, 2025
AI, ML & Data Engineering

OpenAI Adds Full MCP Support to ChatGPT Developer Mode

OpenAI has rolled out full Model Context Protocol (MCP) support in ChatGPT, bringing developers a long-requested feature: the ability to use custom connectors for both read and write actions directly inside chats. The feature, now in beta under Developer Mode, effectively turns ChatGPT into a programmable automation hub capable of interacting with external systems or internal APIs.

Robert Krzaczyński
on Oct 13, 2025
AI, ML & Data Engineering

OpenAI Study Investigates the Causes of LLM Hallucinations and Potential Solutions

In a recent research paper, OpenAI suggested that the tendency of LLMs to hallucinate stems from the way standard training and evaluation methods reward guessing over acknowledging uncertainty. According to the study, this insight could pave the way for new techniques to reduce hallucinations and build more trustworthy AI systems, but not all agree on what hallucinations are in the first place.

Sergio De Simone
on Oct 12, 2025
AI, ML & Data Engineering

Claude Sonnet 4.5 Tops SWE-Bench Verified, Extends Coding Focus beyond 30 Hours

Anthropic's Claude Sonnet 4.5, its most advanced coding model, excels in task performance and safety, achieving a 98.7% safety score and improving real-world coding capabilities. Enhanced reasoning skills allow for sustained multi-step tasks, with notable user gains reported. This drop-in replacement demonstrates a powerful balance of capability and security for users.

Hien Luu
on Oct 11, 2025
AI, ML & Data Engineering

PlanetScale Extends Database Platform to PostgreSQL

PlanetScale has announced the general availability of its managed sharded Postgres service, built for performance and reliability on AWS or Google Cloud. The launch extends PlanetScale's offerings to PostgreSQL users, adding to the company's existing popular MySQL-based platform built on top of Vitess.

Renato Losio
on Oct 11, 2025
AI, ML & Data Engineering

Google DeepMind Introduces CodeMender, an AI Agent for Automated Code Repair

Google DeepMind has introduced CodeMender, a new AI-driven agent designed to detect, fix, and secure software vulnerabilities automatically. The project builds on recent advances in reasoning models and program analysis, aiming to reduce the time developers spend identifying and patching security issues.

Robert Krzaczyński
on Oct 11, 2025
AI, ML & Data Engineering

OpenAI DevDay 2025 Introduces GPT-5 Pro API, Agent Kit, and More

At OpenAI's DevDay 2025, AgentKit and models GPT-5 Pro and Sora 2 were unveiled, enabling interactive software experiences directly within ChatGPT. This shift towards "apps inside ChatGPT" fosters collaboration and commercialization in conversations. Enhanced self-hosting options and robust SDKs empower developers and streamline workflows, positioning OpenAI at the forefront of AI innovation.

Andrew Hoblitzell
on Oct 10, 2025
AI, ML & Data Engineering

QCon AI New York 2025 Schedule Published, Highlights Practical Enterprise AI

The QCon AI New York 2025 schedule is now live for its Dec 16-17 event. Focused on moving AI from PoC to production, the program offers a practical roadmap for senior engineers & tech leaders. It addresses the real-world challenges of building, scaling, and deploying reliable, enterprise-grade AI systems, helping organizations overcome the hurdles of productionizing their AI initiatives.

Artenisa Chatziou
on Oct 10, 2025
AI, ML & Data Engineering

GitHub Introduces New Embedding Model to Improve Code Search and Context

GitHub has introduced a new embedding model for Copilot, now integrated into Visual Studio Code. The model is designed to improve how Copilot understands programming context, retrieves relevant code, and suggests completions.

Daniel Dominguez
on Oct 10, 2025
AI, ML & Data Engineering

The New Data Commons MCP Server Unlocks a Wealth of Public Datasets for AI Developers

Google has recently introduced the Data Commons Model Context Protocol (MCP) Server, a tool that enables AI developers and researchers to easily access the public dataset collection available through Data Commons.

Sergio De Simone
on Oct 09, 2025
AI, ML & Data Engineering

Google DeepMind Launches Gemini 2.5 Computer Use Model to Power UI-Controlling AI Agents

Google DeepMind has recently released the Gemini 2.5 Computer Use model, a specialized variant of its Gemini 2.5 Pro system designed to enable AI agents to interact directly with graphical user interfaces. The new model allows developers to build agents that can click, type, scroll, and manipulate interactive elements on web pages.

Robert Krzaczyński
on Oct 09, 2025

Newer News

Older News