InfoQ Homepage AI, ML & Data Engineering Content on InfoQ
-
Posted by
Lexy Kassan
on
Apr 10, 2025
Responsible AI for FinTech
Lexy Kassan discusses responsible AI: regulation (EU AI Act, FinTech), ethical principles, governance, and FinTech's disruptive response.
on Apr 10, 2025Icon40:25 -
Posted by
Anil Rajput
on
Apr 07, 2025
Unleashing Llama's Potential: CPU-Based Fine-Tuning
Anil Rajput and Rema Hariharan detail CPU-based LLM (Llama) optimization strategies for performance and TCO reduction.
on Apr 07, 2025Icon48:11 -
Posted by
Igor Canadi
on
Apr 03, 2025
Rockset - Building a Modern Analytics Database on Top of RocksDB
Igor Canadi discusses building a real-time search analytics database on RocksDB, covering cloud-native design, replication, shared storage, and analytics.
on Apr 03, 2025Icon47:45 -
Posted by
Meryem Arik
on
Mar 28, 2025
Navigating LLM Deployment: Tips, Tricks, and Techniques
Meryem Arik shares best practices for self-hosting LLMs in corporate environments, highlighting the importance of cost efficiency and performance optimization.
on Mar 28, 2025Icon39:49 -
Posted by
Nischal HP
on
Mar 26, 2025
AI in the Age of Climate Change
Nischal HP shares insights on building a data-driven economy to incentivize sustainable farming and reduce carbon emissions.
on Mar 26, 2025Icon31:54 -
Posted by
David Cheney
on
Mar 24, 2025
How GitHub Copilot Serves 400 Million Completion Requests a Day
David Cheney explains the architecture powering GitHub Copilot, detailing how they achieve sub-200ms response times for millions of daily requests.
on Mar 24, 2025Icon49:24 -
Posted by
Ivan Burmistrov
on
Mar 20, 2025
The Harsh Reality of Building a Real-Time ML Feature Platform
Ivan Burmistrov shares how ShareChat built their own Real-Time Feature Platform serving more than 1 billion features per second, and how they managed to make it cost efficient.
on Mar 20, 2025Icon47:16 -
Posted by
Moumita Bhattacharya
on
Mar 17, 2025
Recommender and Search Ranking Systems in Large Scale Real World Applications
Moumita Bhattacharya overviews the industry search and recommendations systems, goes into modeling choices, data requirements and infrastructural requirements, while highlighting challenges.
on Mar 17, 2025Icon48:46 -
Posted by
Alana Marzoev
on
Mar 06, 2025
Powering User Experiences with Streaming Dataflow
Alana Marzoev discusses the fundamentals of streaming dataflow and the architecture of ReadySet, a streaming dataflow system designed specifically for operational workloads.
on Mar 06, 2025Icon53:21 -
Posted by
Shruti Bhat
on
Feb 28, 2025
Pioneering the Future: Advancing Infrastructure for AI Agents
AI agents, powered by RAG and vector databases, will anticipate needs, automate workflows, and supervise agents. This talk explores infrastructure, security, and impact to help enterprises harness AI.
on Feb 28, 2025Icon40:42 -
Posted by
Olalekan Elesin
on
Feb 26, 2025
Elevate Developer Experience with Generative AI Capabilities on AWS
Olalekan Elesin discusses how generative AI tools can improve productivity, streamline workflows, and foster a more efficient and effective development environment.
on Feb 26, 2025Icon40:06 -
Posted by
Hien Luu
on
Feb 21, 2025
Prompt Engineering: Is it a New Programming Language?
Hien Luu debates if prompt engineering is a programming language, arguing the case for both sides and exploring how this may impact learning and skill acquisition for software developers.
on Feb 21, 2025Icon43:43