← Back to Blog

Kicking Off Intelligent Observability for Seam: My OSRE 2025 Journey

Hi! I'm Manish K Reddy (@kredd2506), a graduate student based in the United States, and I'm excited to join the OSRE 2025 cohort. This summer, I'll be working with the San Diego Supercomputer Center (SDSC) and the National Research Platform (NRP) on a project that blends my interests in machine learning, cloud systems, and real-world impact.

The National Research Platform (NRP) has moved beyond its original vision as a "ScienceDMZ data freeway" and evolved into a distributed cloud supercomputer, empowering research and education across more than 50 institutions. SDSC, located at UC San Diego, is recognized internationally for driving innovation in data, supercomputing, and advanced cyberinfrastructure.

🎯 My Project: Intelligent Observability for Seam

My project, "Intelligent Observability for Seam: A GenAI Approach" focuses on building an ML-powered service for NRP. The goal is to analyze monitoring data (starting with Prometheus metrics), automatically detect anomalies, and use generative AI (GenAI) for human-readable explanations and root-cause analysis. This will help researchers and operators solve problems faster and keep complex research systems running smoothly.

I am especially grateful to my lead mentor Mohammad Firas Sada, who is personally guiding me throughout this project. I also want to thank Jeffrey Weekley and Derek Weitzel for their support and guidance.

📋 Project Specifications

GenAI-Driven Observability for NRP

  • Topics: Machine Learning, Observability, DevOps, High Performance Computing, LLMs, GenAI, Distributed Systems
  • Skills: Python, Prometheus, Docker, Kubernetes, FastAPI, PyTorch, Pandas, LLM APIs, scikit-learn, PostgreSQL
  • Difficulty: Medium
  • Size: 350 hours
  • Mentors: Mohammad Firas Sada, Jeffrey Weekley, Derek Weitzel

🚀 What I'm Looking Forward To

This summer, I'm excited about:

  • Delivering an open-source anomaly detection tool for NRP
  • Building GenAI features for better explanations and root-cause analysis
  • Learning from my mentors and contributing to a vibrant open science community

🔑 Key Takeaways

Bridging AI and Infrastructure

This project represents the perfect intersection of my interests in machine learning and cloud systems, allowing me to build tools that have real-world impact on research infrastructure.

Community-Driven Development

Working with NRP and SDSC provides an opportunity to contribute to open science and support researchers across 50+ institutions with better observability tools.

🚀 What's Next?

In the coming weeks, I'll be diving deep into NRP's monitoring infrastructure, setting up the initial ML pipelines, and beginning work on the anomaly detection algorithms. Stay tuned for progress updates and technical deep-dives!