InfoQ Homepage Machine Learning Content on InfoQ
-
How Meta Is Using AI to Standardize and Cut Carbon Emissions
Meta has developed an AI-based approach to improve the quality of Scope 3 emissions estimates across its IT hardware supply chain. The method combines machine learning and generative models to classify hardware components and infer missing product carbon footprint (PCF) data.
on Oct 31, 2025 -
Google Research Open-Sources the Coral NPU Platform to Help Build AI into Wearables and Edge Devices
Coral NPU is an open-source full-stack platform designed to help hardware engineers and AI developers overcome the limitations that prevent integrating AI in wearables and edge devices, including performance, fragmentation, and user trust.
on Oct 22, 2025 -
Instagram Improves Engagement by Reducing Notification Fatigue with New Ranking Framework
Meta has introduced a diversity-aware ranking framework for Instagram notifications. The system applies multiplicative penalties to reduce repetitive alerts from the same creators or product surfaces, improving engagement while maintaining relevance and introducing content variety.
on Sep 29, 2025 -
An AI-Driven Approach to Creating Effective Learning Experiences at QCon
An experiment was created around a certification program influenced by AI at QCon London, which included special events during the conference, a pre-conference breakfast where participants could learn about upcoming activities, and an AI-driven workshop immediately following the conference. Wes Reisz spoke at InfoQ Dev Summit Boston about a program he led using AI.
on Aug 28, 2025 -
How Netflix is Reimagining Data Engineering for Video, Audio, and Text
Netflix has introduced a new engineering specialization—Media ML Data Engineering, alongside a Media Data Lake designed to handle video, audio, text, and image assets at scale. Early results include richer ML models trained on standardized media, faster evaluation cycles, and deeper insights into creative workflows.
on Aug 25, 2025 -
Roblox Open-Sources AI System to Detect Conversations Potentially Harmful to Kids
Roblox Sentinel is an AI system designed to detect early signs of potential child endangerment for further analysis and investigation. Implemented as a Python library, Sentinel uses contrastive learning to handle highly imbalanced datasets that often challenge traditional classifiers and can be applied to a wide range of use cases.
on Aug 16, 2025 -
Google Releases Major Firebase Studio Updates for Agentic AI Development
At Google Cloud Summit London in early July, Google revealed new capabilities in Firebase Studio that promise to enhance agentic cloud-based development: an autonomous Agent mode, native support for Model Context Protocol (MCP), and Gemini CLI integration. These updates aim to streamline agentic AI development by making AI agents more independent and seamlessly embedded in developer workflows.
on Jul 31, 2025 -
Databricks Agent Bricks Automates Enterprise AI Development with TAO and ALHF Methods
Databricks introduced Agent Bricks, a new product that changes how enterprises develop domain-specific agents. The automated workflow includes generating task-specific evaluations and LLM judges for quality assessment, creating synthetic data that resembles customer data to supplement agent learning, and searching across optimization techniques to refine agent performance.
on Jul 28, 2025 -
Microsoft Adds Deep Research Capability in Azure AI Foundry Agent Service
Unlock the future of research with Microsoft’s Azure AI Foundry Agent Service, featuring Deep Research—an innovative tool that empowers knowledge workers in complex fields. This advanced AI capability autonomously analyzes and synthesizes web data, automating rigorous research tasks while ensuring traceability and transparency. Sign up for the public preview today!
on Jul 14, 2025 -
Arm Scalable Matrix Extension 2 Coming to Android to Accelerate On-Device AI
Available in the Armv9-A architecture, Arm Scalable Matrix Extension 2 (SME2) is a set of advanced CPU instructions designed to accelerate matrix heavy computation. The new Arm technology aims to help mobile developers to run advanced AI models directly on CPU with improved performance and efficiency, without requiring any changes to their apps.
on Jul 13, 2025 -
The Rise of Energy and Water Consumption Using AI Models, and How It Can Be Reduced
Artificial intelligence's (AI) energy and water consumption has become a growing concern in the tech industry, particularly for large-scale machine learning models and data centers. Sustainable AI focuses on making AI technology more environmentally friendly and socially responsible.
on Jun 26, 2025 -
QCon AI New York 2025: Program Committee Announced
Meet the QCon AI New York Program Committee, senior software leaders shaping a practical AI conference for engineers building at scale.
on Jun 11, 2025 -
Google Cloud Run Now Offers Serverless GPUs for AI and Batch Processing
Google Cloud has launched NVIDIA GPU support for Cloud Run, enhancing its serverless platform with scalable, cost-efficient GPU resources. This upgrade enables rapid AI inference and batch processing, featuring pay-per-second billing and automatic scaling to zero. Developers can access seamless GPU support easily, making advanced AI applications faster and more accessible.
on Jun 09, 2025 -
Virt8ra Sovereign Cloud Expands with Six New European Providers
Virt8ra is a groundbreaking European initiative aiming to establish a sovereign, interoperable cloud ecosystem, countering US cloud dominance. With significant expansion, now inclusive of six new providers, and a focus on open-source technology, Virt8ra promotes data localization and vendor independence, paving the way for an innovative digital future across Europe.
on Jun 04, 2025 -
Azure AI Search Unveils Agentic Retrieval for Smarter Conversational AI
Microsoft’s Azure AI Search unveils agentic retrieval, a cutting-edge query engine that enhances conversational AI answer relevance by up to 40%. This dynamic system leverages conversation history and parallel subquery execution, paving the way for sophisticated knowledge retrieval. Currently in public preview, it offers adaptive search strategies tailored for evolving enterprise needs.
on May 31, 2025