InfoQ Homepage Machine Learning Content on InfoQ

Presentations

RSS Feed

Newer Older

AI, ML & Data Engineering

From "Simple" Fine-Tuning to Your Own Mixture of Expert Models Using Open-Source Models

Sebastiano Galazzo shares practical tips and mistakes in creating custom LLMs for cost-effective AI. Learn LoRA, merging, MoE & optimization.

Sebastiano Galazzo
on Apr 23, 2025

Icon

48:19
AI, ML & Data Engineering

How Green is Green: LLMs to Understand Climate Disclosure at Scale

Leo Browning explains the journey of developing a Retrieval Augmented Generation (RAG) system at a climate-focused startup.

Leo Browning
on Apr 22, 2025

Icon

47:29
AI, ML & Data Engineering

LLM and Generative AI for Sensitive Data - Navigating Security, Responsibility, and Pitfalls in Highly Regulated Industries

Stefania Chaplin and Azhir Mahmood discuss responsible, secure, and explainable AI in regulated industries. Learn MLOps, legislation, and future trends.

Stefania Chaplin Azhir Mahmood
on Apr 17, 2025

Icon

43:50
AI, ML & Data Engineering

Unleashing Llama's Potential: CPU-Based Fine-Tuning

Anil Rajput and Rema Hariharan detail CPU-based LLM (Llama) optimization strategies for performance and TCO reduction.

Anil Rajput Rema Hariharan
on Apr 07, 2025

Icon

48:11
AI, ML & Data Engineering

Navigating LLM Deployment: Tips, Tricks, and Techniques

Meryem Arik shares best practices for self-hosting LLMs in corporate environments, highlighting the importance of cost efficiency and performance optimization.

Meryem Arik
on Mar 28, 2025

Icon

39:49
Architecture & Design

The Harsh Reality of Building a Real-Time ML Feature Platform

Ivan Burmistrov shares how ShareChat built their own Real-Time Feature Platform serving more than 1 billion features per second, and how they managed to make it cost efficient.

Ivan Burmistrov
on Mar 20, 2025

Icon

47:16
AI, ML & Data Engineering

Recommender and Search Ranking Systems in Large Scale Real World Applications

Moumita Bhattacharya overviews the industry search and recommendations systems, goes into modeling choices, data requirements and infrastructural requirements, while highlighting challenges.

Moumita Bhattacharya
on Mar 17, 2025

Icon

48:46
AI, ML & Data Engineering

Flawed ML Security: Mitigating Security Vulnerabilities in Data & Machine Learning Infrastructure with MLSecOps

Adrian Gonzalez-Martin introduces the motivations and the importance of security in data & ML infrastructure through a set of practical examples showcasing "Flawed Machine Learning Security".

Adrian Gonzalez-Martin
on Feb 20, 2025

Icon

40:01
AI, ML & Data Engineering

Leveraging Open-source LLMs for Production

Andrey Cheptsov discusses the practical use of open-source LLMs for real-world applications, weighing their pros and cons, highlighting advantages like privacy and cost-efficiency.

Andrey Cheptsov
on Feb 12, 2025

Icon

44:16
AI, ML & Data Engineering

Scale out Batch Inference with Ray

Cody Yu discusses how to build a scalable and efficient batch inference stack using Ray.

Cody Yu
on Jan 31, 2025

Icon

47:50
AI, ML & Data Engineering

Why Most Machine Learning Projects Fail to Reach Production and How to Beat the Odds

Wenjie Zi discusses common pitfalls that cause these failures, such as the inherent uncertainty of machine learning, misaligned optimization objectives, and skill gaps among practitioners.

Wenjie Zi
on Jan 24, 2025

Icon

48:57
AI, ML & Data Engineering

Navigating LLM Deployment: Tips, Tricks, and Techniques

Meryem Arik discusses some of the best practices in model optimization, serving and monitoring - with practical tips and real case-studies.

Meryem Arik
on Nov 19, 2024

Icon

44:21

Newer Presentations

Older Presentations

Topics

Quantum Shift: Rewiring the Tech Landscape

Architectures You’ve Always Wondered About 2025

From "Simple" Fine-Tuning to Your Own Mixture of Expert Models Using Open-Source Models

A Game of Patterns

Implement the EU Cyber Resilience Act's Requirements to Strengthen Your Software Project

Helpful links

Choose your language

Presentations

From "Simple" Fine-Tuning to Your Own Mixture of Expert Models Using Open-Source Models

How Green is Green: LLMs to Understand Climate Disclosure at Scale

LLM and Generative AI for Sensitive Data - Navigating Security, Responsibility, and Pitfalls in Highly Regulated Industries

Unleashing Llama's Potential: CPU-Based Fine-Tuning

Navigating LLM Deployment: Tips, Tricks, and Techniques

The Harsh Reality of Building a Real-Time ML Feature Platform

Recommender and Search Ranking Systems in Large Scale Real World Applications

Flawed ML Security: Mitigating Security Vulnerabilities in Data & Machine Learning Infrastructure with MLSecOps

Leveraging Open-source LLMs for Production

Scale out Batch Inference with Ray

Why Most Machine Learning Projects Fail to Reach Production and How to Beat the Odds

Navigating LLM Deployment: Tips, Tricks, and Techniques

Scaling API Independence: Akehurst on Mocking, Contract Testing, and Observability

Activision Reduces Build Time of Call of Duty by 50% with MSVC Build Insights

Fast Eventual Consistency: Inside Corrosion, the Distributed System Powering Fly.io

Scaling Financial Operations: Uber’s GenAI-Powered Approach to Invoice Automation

Architectures You’ve Always Wondered About 2025

Legacy Modernization: Architecting Real-Time Systems Around a Mainframe

A Game of Patterns

How Developers Can Eliminate Software Waste and Reduce Climate Impact

Building Empathy and Accessibility: Fostering Better Engineering Cultures and Developer Experiences

Docker Bridges Agents and Containers with New MCP Catalog and Toolkit

Google's Gemma 3 QAT Language Models Can Run Locally on Consumer-Grade GPUs

Google DeepMind Shares Approach to AGI Safety and Security

Announcing Styrolite, a Low Level Container Runtime

Overcoming Challenges with eBPF Flow IP Address Misattribution at Netflix

Kubernetes 1.33 “Octarine” Released: Native Sidecars and In-Place Pod Resizing

InfoQ Dev Summit Boston

InfoQ Dev Summit Munich

QCon San Francisco

QCon AI New York

InfoQ Software Architects' Newsletter

Login with:

Don't have an InfoQ account?

Presentations