0
Act 4

Mastery

8 / 10

FTI Architecture

Act 4 · ~5 min

Theory

FTI Architecture divides every ML system into exactly three pipeline types, each with a single responsibility, connected through versioned shared storage.

PipelineInputOutputStores to
FeatureRaw dataFeatures + labelsFeature store
TrainingFeature store snapshotTrained modelModel registry
InferenceFeature store + model registryPredictionsClient / DB
Raw Datacrawl · ingest
Feature Pipelineclean · chunk · embed
Feature StoreQdrant / versioned
Feature Storeread snapshot
Training PipelineSFT · DPO · merge
Model RegistryHuggingFace / versioned
Model Registryload model
Inference Pipelineretrieve · generate
ResponseAPI · batch
FTI data flow: feature store and model registry are the only cross-pipeline contracts.

Why it matters:

  • No training-serving skew — the feature pipeline writes to the store once; both training and inference read from that same store.
  • Reproducibility — the model registry records which feature-store version was used; any training run can be replayed exactly.
  • Isolation — each pipeline can use a different tech stack, scale independently, and be tested without touching the others.

LLMOps forward link: monitoring, drift detection, and automated retraining triggers are the operational layer that wraps FTI pipelines in production.