From Research Models to SLAs: Operationalizing TSFMs with Python

Jeyashree Krishnan, Catarina Filipe

MLOps & DevOps
Python Skill Intermediate
Domain Expertise Advanced

Motivation

Time series foundation models promise rapid prototyping and strong performance across domains, but many teams struggle to move beyond notebooks and benchmarks. In practice, the hardest problems are not model accuracy or architecture, but integration, operability, and developer experience.

This talk addresses a common but under-discussed question:

How do you operationalize time series foundation models inside a large organization with real users, real constraints, and real SLAs?

Case study context

The talk is based on hands-on experience building and operating Siemens KPI Forecast, a Python-based forecasting platform that exposes multiple TSFMs through stable APIs. The platform integrates:

Chronos, Lag-Llama, and TimesFM are open-source research models, while GTT is a proprietary Siemens model. The platform is designed to treat both open and closed-source models uniformly from a developer and user perspective.

Topics covered

  • Why TSFMs are easy to prototype but hard to operationalize
  • Designing Python APIs that unify multiple foundation models
  • Supporting zero-shot inference, fine-tuning jobs, and fine-tuned inference in one system
  • Integrating open-source and proprietary models consistently
  • Making forecasting services accessible to different user personas
  • Operating ML services with SLAs in a B2B environment
  • Monitoring, versioning, and governance considerations

What attendees will learn

  • How to structure Python services around foundation models
  • How to avoid fragmentation when supporting multiple models and workflows
  • Practical MLOps patterns for operating ML services beyond notebooks
  • Lessons learned from running TSFMs at organizational scale

This session focuses on engineering and operational lessons that are broadly applicable to teams building Python-based ML platforms in both enterprise and open-source contexts. Model references are included for transparency; the talk focuses on system design and operational patterns rather than proprietary details.

Jeyashree Krishnan

Jeyashree Krishnan is a Senior Machine Learning Engineer at Siemens AG. Her work focuses on building and operationalizing scalable machine learning services, with an emphasis on foundation models and time series forecasting. She is also a Visiting Researcher at the Center for Computational Life Sciences, RWTH Aachen University.

Catarina Filipe