retrieval-testing
LangChain integration for LongProbe — sub-second RAG retrieval regression testing with chunk-level diffing