Retrieval-augmented generation is usually treated as a model feature. In production, it behaves more like a distributed system.
The hard parts are document quality, chunking strategy, permissions, evaluation, freshness, observability, and user trust.
The model matters. The engineering around the model determines whether the feature survives real use.