r/MachineLearning

ml. Beginners please see learnmachinelearning

Members

Online

•

Jman7762

[D] Monitoring and Debugging RAG Systems in Production

Discussion

Hi!

I’m part of a team from MIT, where we specialize in developing advanced tools for data visualizations of the latent space. We are currently exploring how visualizations can help increase the effectiveness of RAG monitoring systems and would love to gather insights from how people manage RAGs currently.

We know there are existing monitoring tools like Ragas, Arize(Phoenix), LangSmith. We are curious on how

Frequency you are looking at monitoring data
What does the end-user application your RAG support look like?

We believe that a visualization tool could greatly enhance the ability to monitor and debug RAG systems in real-time by:

Providing intuitive, graphical representations of system performance and behavior.
Highlighting potential issues and bottlenecks at a glance.

If you’re willing to share more detailed insights through an interview, please let us know! Happy to get connected and learn more!

Sort by:

Best

Open comment sort options

Best

Top

New

Controversial

Old

Q&A

newpeak

•

RAG is not a LLMOps problem currently, because it still has a lot of room for improvement in terms of relevance. Therefore, monitoring the pipeline is not helpful. When the effects of RAG is good enough, could the requirements of LLMOps emerged.

Solutions of taking RAG into an orchestration problem does belong to the so called RAG 1.0 which does not make sense. RAG 2.0 means an end-to-end solution which requires a series of components to work closely together including :

Excellent data chunking tools to recognize the semantics of unstructured data.
Query model or query rewrite operators because there are always semantic gap between questions and answers.
Good enough databases for retrieval. Within the predictable future, hyrbid search is always a MUST, such as Blended RAG even requires three-ways of hybrid search.
Ranking models which are tailord for the vertical scenarios.
Dynamic orchestration for Agentic RAG.

I think you can focus on any of the above as they have more pressing needs think you could take focuse on any of the above which could have much higher emergent. ps, we are striving to make the above happen through https://github.com/infiniflow/ragflow , based on which some further works could be done.

Eam404

•

u/newpeak - really great response.

More replies

aveho_adhuc_7409

•

RAG monitoring can be a black box, really interested in seeing what you develop!

Get the Reddit app

[D] Monitoring and Debugging RAG Systems in Production

More posts you may like

Third party apps, in Apple TestFlight, that supports iOS 18 Control Center:

[R] Trillion-Parameter Sequential Transducers for Generative Recommendations

Engineers fabricate a chip-free, wireless electronic “skin”

[D] Biggest Pain Points in Data Preparation and Cleaning?

[WP] Bitcoin is actually a tool created by an advanced AI to get humans to create faster and cheaper computers by exploiting their most predictable trait: Greed.

[D] ML Researchers in Industry: How Do You Find Time to Publish Papers?

Cauliflower stir fry - 145 calories!

Seeking feedback on features for better monitoring & troubleshooting Kafka

Microsoft will ship DirectX 12 alongside Windows 10 in 2015, the computing firm has announced.

[P] [D] Hi I'm a senior machine learning engineer, looking for for buddies to build cool stuff with!

Seeking Advice on Improving AI BGM Generation for Game Developers

[D]LLM interview Q&A

[D] How to network at a conference

[D] Using Models for Hypothesis Generation

[D] Current state of Chatbot pipelines in Commercial settings?

Looking for Feedback on Our Construction Management SaaS

[TASK] Looking for a Zapier expert to help streamline business processes

Anyone working on UNICEF projects that also uses Quickbooks to prepare FACE reports? I'd love to pick your brain!

[R] Lamini.AI introduces Memory Tuning: 95% LLM Accuracy, 10x Fewer Hallucinations

[D] François Chollet Announces New ARC Prize Challenge – Is It the Ultimate Test for AI Generalization?

Revamping Legacy Hospital System: Seeking Advice on Caching and Dockerizing Development Workflow

Seeking Advice on Improving AI bgm Generation for Game Developers and Content Creators

Top 5 automation tasks using zapier for video scriptwriting

[R] Are you a reviewer for NeurIPS'24? Please read this

[D] Feeling Lost in My ML Career: Advice Needed

Get the Reddit app

[D] Monitoring and Debugging RAG Systems in Production

More posts you may like

Third party apps, in Apple TestFlight, that supports iOS 18 Control Center:

[R] Trillion-Parameter Sequential Transducers for Generative Recommendations

Engineers fabricate a chip-free, wireless electronic “skin”

[D] Biggest Pain Points in Data Preparation and Cleaning?

[WP] Bitcoin is actually a tool created by an advanced AI to get humans to create faster and cheaper computers by exploiting their most predictable trait: Greed.

[D] ML Researchers in Industry: How Do You Find Time to Publish Papers?

Cauliflower stir fry - 145 calories!

Seeking feedback on features for better monitoring & troubleshooting Kafka

Microsoft will ship DirectX 12 alongside Windows 10 in 2015, the computing firm has announced.

[P] [D] Hi I'm a senior machine learning engineer, looking for for buddies to build cool stuff with!

Seeking Advice on Improving AI BGM Generation for Game Developers

[D]LLM interview Q&A

[D] How to network at a conference

[D] Using Models for Hypothesis Generation

[D] Current state of Chatbot pipelines in Commercial settings?

Looking for Feedback on Our Construction Management SaaS

[TASK] Looking for a Zapier expert to help streamline business processes

Anyone working on UNICEF projects that also uses Quickbooks to prepare FACE reports? I'd *love* to pick your brain!

[R] Lamini.AI introduces Memory Tuning: 95% LLM Accuracy, 10x Fewer Hallucinations

[D] François Chollet Announces New ARC Prize Challenge – Is It the Ultimate Test for AI Generalization?

Revamping Legacy Hospital System: Seeking Advice on Caching and Dockerizing Development Workflow

Seeking Advice on Improving AI bgm Generation for Game Developers and Content Creators

Top 5 automation tasks using zapier for video scriptwriting

[R] Are you a reviewer for NeurIPS'24? Please read this

[D] Feeling Lost in My ML Career: Advice Needed

Anyone working on UNICEF projects that also uses Quickbooks to prepare FACE reports? I'd love to pick your brain!