Overview
QCon San Francisco
Schedule
Reception - Dry Snacks & Drinks - Ballroom Foyer & Pacific Level - Ends at 6:30pm
Continental Breakfast - Pacific Level
Continental Breakfast - Pacific Level
Turn QCon insights into career-defining decisions.
Organizing for Fast Flow: A Practitioner\'s View on Implementing Team Topologies
[SOLD OUT] Building Resilient Asynchronous and Event-Driven Systems
Building Intelligent Java Applications: A Developer’s Workshop
Chat with your website using an LLM and open stack (Allycat)
From Friction to Flow: How Great DevEx Makes Everything Awesome
Conference Introduction and Keynote: Open Source, Community, and Consequence: The Story of MongoDB
Break
Architecting for Efficiency: Performance, Cost, and Maintainability
The Stories Behind the Incidents
Empowering Teams with AI: Productivity and the Future of Software Development
High-Performance Languages in Modern Development
Sponsored Solution Track III
How to Build an Exchange: Sub Millisecond Response Times and 24/7 Uptimes in the Cloud
These days it is possible to achieve fairly good performance on cloud provisioned systems. We discuss the design of a high performance, strongly consistent system which maintains constant service in the face of regular updates to core logic.
Continuous Delivery for Foundational Platforms
Platform teams frequently inherit systems that were never architected for their current scale, yet are so foundational that downtime can halt the business.
Designing AI Platforms for Reliability: Tools for Certainty, Agents for Discovery
Modern AI platforms don’t have to choose between deterministic precision and probabilistic exploration—they need both.
From Founding Engineer to CTO to CEO – At the Same Startup
As a college sophomore, I founded Pointz, initially building its core routing engine from scratch. I architected the platform’s safety rating algorithm, integrating geospatial data structures, crowdsourced inputs, and multi-criteria decision-making to deliver context-aware routing.
The Future of Agentic AI: Architecting the Global Control Plane
The agentic AI landscape is facing an architectural crisis due to framework fragmentation, bottlenecking the creation of truly reliable autonomous systems.
The Rust High Performance Talk You Did Not Expect
Rust runs faster, but it slows down engineers, right? This was our team’s assumption when we decided to rewrite our code from Kotlin into Rust. But we were wrong in completely unexpected ways.
Producing the World\'s Cheapest Tokens: A How-to Guide
AI inference is expensive, but it doesn’t have to be. In this talk, we’ll break down how to systematically drive down the cost per token across different types of AI workloads.
The Human Toll of Incidents & Ways To Mitigate It
Have you ever wondered what it\'s like to respond to a significant incident? Walk through an hour by hour reconstruction of an incident response or two, focusing on what it was like to be \"in the room\" and the human response to the incidents.
Choosing Your AI Copilot: Maximizing Developer Productivity
The AI coding agent landscape evolves weekly. This talk compares today’s frontrunners, shows where each shines, and shares prompts, policies, and “rules templates” that turn code suggestions into production-quality output.
Building Trustworthy Agentic Systems with Workflow Orchestration
As engineering teams explore building AI agents, one challenge becomes clear: agents don’t live in isolation, and trust in agentic systems is often lacking.
Break
Break & Attendee Lightning Talks (Pacific C)
Compiling Workflows into Databases: The Architecture That Shouldn\'t Work (But Does)
What if everything you know about building distributed systems is backwards?
Beyond Line Charts: Why Some Diversity in Telemetry Visualization Is Long Overdue
For decades, visualization of service metrics overwhelmingly converges to line charts. The time-centric nature of real-time telemetry further cemented this phenomenon via storage layouts and domain-specific query languages.
Confidently Automating Changes Across a Diverse Fleet
Maintaining up-to-date and secure software across a polyglot fleet is a challenge for any engineering organization. Manual migrations and urgent updates disrupt productivity and require coordination across many teams.
From Staff Platform Engineer to a 16z Founder: What I Wish I\'d Known
The Technical Leadership Challenge: Every successful engineering team faces the same inflection point: transforming beloved internal tools into a legitimate platform business.
Shift Left with Observability and AI Driven Development
Developers today are not just writing code, they’re orchestrating complex systems, managing pipelines, and maintaining reliability at scale. But when observability starts too late in the lifecycle, teams are left reacting instead of building.
How Netflix Shapes our Fleet for Efficiency and Reliability
Netflix runs on a complex multi-layer cloud architecture made up of thousands of services, caches, and databases. As hardware options, workload patterns, cost dynamics and the Netflix products evolve, the cost-optimal hardware and configuration for running our services is constantly changing.
When Incidents Refuse to End
As engineers, we’re used to managing failure, but long-running outages hit differently. They stretch teams, systems, and assumptions about how incidents “should” play out.
Accelerating LLM-Driven Developer Productivity at Zoox
Over the past year, Zoox has invested in integrating Large Language Models (LLMs) into internal developer workflows through a company-wide initiative called Zoox Intelligence (ZI).
Instrumentation at Scale: Having Your Performance Cake and Eating It Too
In high-performance code, a single misplaced counter increment can cost more than the operation it’s measuring. That creates a paradox: instrument too much and you slow the system down; instrument too little and you miss the insights you need to continuously deliver.
Beyond Vector Search: Building Ultra-Resilient AI with Distributed SQL
As enterprises seek to move AI from proof-of-concept to production, standalone vector databases face limits in synchronization, ACID compliance, and resilience. This session shows how PostgreSQL-compatible distributed databases address these issues while keeping a familiar developer experience.
Lunch & Lunch Topic Discussions on Atrium Level.
Lunch
Women & Allies in Tech Lunch Session
Leading and Analyzing Operational Incidents From Chaos to Resolution
Fundamentals of Software Engineering in the Age of AI
Open Source Rag Pipeline With Docling + Data Prep Kit + Milvus + Open LLMs
From Kubernetes-Native to Ai-Powered: Java With Langchain4j
Parting the Clouds: The Rise of Disaggregated Systems
Cloud systems are undergoing an architectural shift. Traditional shared-nothing designs struggle to deliver the elasticity, availability, and operational simplicity that the cloud demands.
Microservices Platforms: When Team Topologies Meets Microservices Patterns
When many teams work on a large, complex application, the microservice architecture potentially enables them to work independently and deliver a continuous stream of changes.
Secure Software Supply Chain: Risk Prediction at the Speed of Development
The Platform That Sees Risk Before Code Does
Maximizing Success with Limited Time, Resources, and Energy: Lessons from Startup Engineering
Startups are the harshest environments for engineers. Limited time, resources, and energy force teams to make decisions under pressure — and those decisions can determine whether a company survives or stalls.
AI won\'t fix developer productivity (unless you fix context first)
AI coding tools promise productivity gains, but many teams aren’t seeing the impact.
Realtime and Batch Processing of GPU Workloads
SS&C Technologies runs 47 trillion dollars of assets on our global private cloud. We have the primitives for infrastructure as well as platforms as a service like Kubernetes, Kafka, NiFi, Databases, etc.
The Ironies of A^2 I^2
In this talk, we\'ll explore some of the \"ironies\" of automation—and now, artificial intelligence—in their interactions with software operators (i.e. you), especially during high consequence, high tempo situations (aka incidents).
Powering the Future: Building Your GenAI Infrastructure Stack
Behind every productivity leap is a rock-solid platform. Go under the hood with Intuit’s GenOS team to see how vector stores, prompt management, RAG pipelines, and agent orchestration come together to serve ~100 million users.
When Every Bit Counts: How Valkey Rebuilt Its Hashtable for Modern Hardware
Ever wondered what happens when a bunch of performance-obsessed developers decide their blazing-fast database isn\'t quite blazing-fast enough?
Sponsored presentation powered by Pure Storage
Details coming soon!
Break
Break & Attendee Lightning Talks (Pacific C)
Keeping the Mainline Green Across Diverse Language Monorepos
At Uber’s scale, ensuring an always-green mainline while processing hundreds of changes per hour is a massive challenge— especially when those changes span multiple language monorepos supporting dozens of business-critical apps.
Building Resilient Platforms: Insights from 20+ Years in Mission-Critical Infrastructure
In this talk, Matthew will describe lessons learned from over 20+ years of building scalable, secure and stable infrastructure platforms for software in financial services (electronic trading, credit card processing etc.), the talk is relevant to anyone building platforms for mission-critic
Unconference: Modern Platform Engineering and Dev Enablement
The Future of Engineering: Mindsets That Matter When Code Isn’t Enough
Since the first compiler, software has been a stack of human-friendly abstractions translated into machine instructions. The engineers who understood at least some of what was going on under the hood were essential—indispensable even. But now?
AI Native Architecture for Java Applications
We are currently moving from \"AI-enabled\" systems, where artificial intelligence is an additive feature, to \"AI-native\" systems, where intelligence is the foundational, architectural core.
From ms to µs: OSS Valkey Architecture Patterns for Modern AI
As AI applications demand faster and more intelligent data access, traditional caching strategies are hitting performance and reliability limits.
Week-Long Outage: Lifelong Lessons
Routine database upgrades should be straightforward, especially with familiar, well-established technology. We were confident heading into our Elasticsearch upgrade, equipped with a solid plan and excited to see performance gains like we had seen from past upgrades.
AI-Driven Productivity: From Idea to Impact
In this session you\'ll learn how product leaders turn GenAI enthusiasm into an enterprise-ready blueprint for real productivity gains.
Accelerating Performance by Incrementally Integrating Rust Into Existing Codebase
In order to improve the performance of existing applications and services, we can identify the most performance-critical pieces and reimplement them in Rust as opposed to completely rewriting the applications from scratch.
From Events to Actions: Building Apps That Respond to Users Instantly
Most engineering talks start with buzzwords like “AI” and “personalization.” This one stMarts with a question: How do you implement in-session actions in your application?
Coffee Break & Snacks - Pacific Level
Coffee Break & Snacks - Pacific Level
Architecting a Centralized Platform for Data Deletion at Netflix
What does it take to safely delete data at Netflix scale? In large-scale systems, data deletion cuts across infrastructure, reliability, and performance complexities.
Platform Engineering: Lessons from the Rise and Fall of eBay Velocity
Once a stock market darling and a pioneering hyperscaler in the 1990s and early 2000s, eBay has been in steady decline since the 2010s. A household name with a flat business, eBay has been unable to make substantive strides in its market reach or its engineering outcomes in the last 15 years.
Rust at the Core - Accelerating Polyglot SDK Development
Developing SDKs for your users in multiple languages can come at a high cost - especially if you need to implement complex logic client side, but traditionally options for sharing logic across those languages have been quite limited.
Unconference: Early Stage Startups
Building Zero-CVE Container Images at Scale: Patterns and Pitfalls
Every team wants secure container images, but few realize how complex it becomes at scale.
Write-Ahead Intent Log: A Foundation for Efficient CDC at Scale
As companies grow, so does the complexity of keeping distributed systems in sync. At DoorDash, we tackled this challenge while building a high-throughput, domain-oriented data platform for capturing changes across hundreds of services.
The Time it Wasn\'t DNS
In January of 2023, the Microsoft Azure Wide Area Network experienced a global outage. If you were a Microsoft customer at the time, you were impacted by this outage.
Trustworthy Productivity: Securing AI-Accelerated Development
Autonomous agents operate in a continuous loop: perceive context → reason → execute tools → observe. Each edge creates distinct attack surfaces. This talk maps vulnerabilities—memory poisoning in context ingestion, goal hijacking during reasoning, blind execution at the action stage.
Python, Numba, and Algorithm Design: Building Efficient Models in Financial Services
The popularity of Python means insurance and financial services companies have a growing body of actuaries, quantitative developers, and software engineers capable of building innovative and customized solutions for both data management and modeling.
Beyond Code: Building a Personal Brand To Boost Your Career
In an increasingly competitive field, software expertise alone may not be enough to stand out and drive your career forward.
Break
QCon Closing Reception - Dry Snacks & Drinks - Ballroom Foyer
Unconference: Architectures You\'ve Always Wondered About
Enhancing Reliability Using Service-Level Prioritized Load Shedding at Netflix
How does Netflix maintain a seamless viewing experience for millions of users, especially during traffic spikes or when backend datastores are overloaded? Autoscaling can help during traffic spikes, but it costs money, takes a few minutes to kick in, and capacity may not always be available.
Directing a Swarm of Agents for Fun and Profit
Coding agents are a new tool, which many of us are trying to figure out how to use effectively.
Beyond the Build: Early-Stage Mindset for the Everything Engineer
In an early-stage startup, the lines between roles blur, and engineering can become less about building and more about what makes the company attractive.