InfoQ Homepage Machine Learning Content on InfoQ
-
Posted by
Iaroslav Amerkhanov
on
Aug 27, 2025
AI for Food Image Generation in Production: How & Why
Iaroslav Amerkhanov discusses how his team at Delivery Hero leveraged GenAI to generate food images, detailing the architecture, optimization, and business impact.
on Aug 27, 2025Icon44:46 -
Posted by
Victor Dibia
on
Aug 14, 2025
10 Reasons Your Multi-Agent Workflows Fail and What You Can Do about It
Victor Dibia discusses multi-agent systems, detailing how to build them with AutoGen, common failure points, and strategic approaches for senior software developers and engineering leaders.
on Aug 14, 2025Icon48:24 -
Posted by
Bibek Bhattarai
on
Jul 31, 2025
Maximizing Deep Learning Performance on CPUs using Modern Architectures
Bibek Bhattarai demystifies Intel AMX, explaining how this CPU architecture accelerates deep learning workloads via low-precision matrix multiplication and efficient data handling.
on Jul 31, 2025Icon39:25 -
Posted by
Denys Linkov
on
Jul 01, 2025
A Framework for Building Micro Metrics for LLM System Evaluation
Denys Linkov discusses critical lessons for senior developers and leaders on building robust LLM systems and actionable metrics that prevent production issues and drive business value.
on Jul 01, 2025Icon29:10 -
Posted by
David Berg
on
Jun 17, 2025
Supporting Diverse ML Systems at Netflix
David Berg and Romain Cledat discuss Metaflow, Netflix's ML infrastructure for diverse use cases from computer vision to recommendations.
on Jun 17, 2025Icon49:00 -
Posted by
Sebastiano Galazzo
on
Apr 23, 2025
From "Simple" Fine-Tuning to Your Own Mixture of Expert Models Using Open-Source Models
Sebastiano Galazzo shares practical tips and mistakes in creating custom LLMs for cost-effective AI. Learn LoRA, merging, MoE & optimization.
on Apr 23, 2025Icon48:19 -
Posted by
Leo Browning
on
Apr 22, 2025
How Green is Green: LLMs to Understand Climate Disclosure at Scale
Leo Browning explains the journey of developing a Retrieval Augmented Generation (RAG) system at a climate-focused startup.
on Apr 22, 2025Icon47:29 -
Posted by
Stefania Chaplin
on
Apr 17, 2025
LLM and Generative AI for Sensitive Data - Navigating Security, Responsibility, and Pitfalls in Highly Regulated Industries
Stefania Chaplin and Azhir Mahmood discuss responsible, secure, and explainable AI in regulated industries. Learn MLOps, legislation, and future trends.
on Apr 17, 2025Icon43:50 -
Posted by
Anil Rajput
on
Apr 07, 2025
Unleashing Llama's Potential: CPU-Based Fine-Tuning
Anil Rajput and Rema Hariharan detail CPU-based LLM (Llama) optimization strategies for performance and TCO reduction.
on Apr 07, 2025Icon48:11 -
Posted by
Meryem Arik
on
Mar 28, 2025
Navigating LLM Deployment: Tips, Tricks, and Techniques
Meryem Arik shares best practices for self-hosting LLMs in corporate environments, highlighting the importance of cost efficiency and performance optimization.
on Mar 28, 2025Icon39:49 -
Posted by
Ivan Burmistrov
on
Mar 20, 2025
The Harsh Reality of Building a Real-Time ML Feature Platform
Ivan Burmistrov shares how ShareChat built their own Real-Time Feature Platform serving more than 1 billion features per second, and how they managed to make it cost efficient.
on Mar 20, 2025Icon47:16 -
Posted by
Moumita Bhattacharya
on
Mar 17, 2025
Recommender and Search Ranking Systems in Large Scale Real World Applications
Moumita Bhattacharya overviews the industry search and recommendations systems, goes into modeling choices, data requirements and infrastructural requirements, while highlighting challenges.
on Mar 17, 2025Icon48:46