Generative AI Monitoring and Reliability Manager

Oversee system health, troubleshoot anomalies, and implement reliability benchmarks to ensure dependable AI tools for education.

Remote

Part time

About Reality AI Lab

Reality AI Lab is advancing open-source AI tools that empower global education and career growth. Our mission is to develop AI Agents that support educators and learners worldwide, beginning with Marvel AI (an AI Teaching Assistant) and Sky AI (an AI Career Coach). Our tools are designed to make education more accessible and provide career-focused solutions that help people thrive.

Role Overview

We are seeking a Generative AI Monitoring and Reliability Manager to oversee and ensure the performance, reliability, and stability of our Generative AI models. In this role, you’ll be responsible for implementing monitoring processes, detecting system anomalies, and continuously improving the reliability of our AI tools. This position is ideal for someone with experience in AI system monitoring, performance optimization, or reliability engineering, who is passionate about maintaining high-quality and dependable AI solutions.

Key Responsibilities

System Monitoring and Performance Management: Oversee the operational health of Generative AI models, tracking performance indicators to ensure consistent and stable functionality across various educational applications.
Anomaly Detection and Troubleshooting: Develop and implement processes for identifying and resolving deviations, bugs, or unusual system behaviors, working quickly to address issues to minimize user impact.
Reliability and Uptime Optimization: Set and maintain benchmarks for uptime and reliability, establishing protocols that maximize system stability and ensure consistent access for users.
Continuous Improvement: Collaborate with developers and data scientists to implement feedback loops and enhance model reliability based on monitoring insights, performance data, and user feedback.
Documentation and Reporting: Maintain comprehensive records of monitoring practices, system performance metrics, and troubleshooting protocols, providing regular reliability reports to stakeholders.

Requirements

Experience in AI System Monitoring or Reliability Engineering: Background in system monitoring, reliability engineering, or operations, preferably within AI or machine learning environments.
Proficiency with Monitoring Tools: Familiarity with monitoring and reliability tools such as Grafana, Prometheus, or similar platforms to track AI performance metrics effectively.
Strong Analytical and Troubleshooting Skills: Skilled in identifying, analyzing, and resolving performance issues within complex AI systems, with a focus on enhancing reliability and uptime.
Attention to Detail: High level of attention to system accuracy, with an ability to detect subtle performance changes or deviations that could impact user experience.
Communication and Documentation Skills: Ability to document procedures, create detailed reports, and communicate insights effectively to development and operations teams.
Commitment to Continuous Improvement: Passion for refining monitoring practices, ensuring the best possible AI system performance, and supporting the educational mission of Reality AI Lab.

Additional Information

Commitment: Part-time, unpaid open-source contribution role with flexible scheduling.
Duration: 1-6 months, remote work setup.
Diversity and Inclusion: Reality AI Lab is an equal opportunity organization, committed to fostering a diverse and inclusive environment.

‍

Apply now

Stay Connected for AI Lab Career Opportunities

Be the first to know about new roles and exciting opportunities at Reality AI Lab.



You're all set! 🎉 Thank you for subscribing to Reality AI Lab career updates.

Oops! Something went wrong. 😔 Please check your email address and try again. If the issue persists, feel free to contact us for assistance.

More open position

View all roles

AI Product Development Manager



Lead the development of innovative AI tools, aligning strategies with Reality AI Lab's mission to empower education globally.

Remote

Part time



AI Product Strategy Manager



Drive strategic vision for AI tools, aligning products with market needs and advancing open-source education innovation.

Remote

Part time



AI Project Manager



Coordinate AI project planning and execution, fostering collaboration and ensuring timely delivery within an open-source environment.

Remote

Part time



AI Program Manager



Oversee multiple AI projects to align with strategic goals, driving impact through effective program coordination and collaboration.

Remote

Part time



People Operations Manager



Foster a thriving, inclusive contributor community by managing relations, onboarding, and support processes in open-source projects.

Remote

Part time



Learning and Development Manager



Empower contributors with learning programs that build skills and foster collaboration within the open-source community.

Remote

Part time



Community Engagement Manager



Build a vibrant, collaborative open-source community by supporting contributors and fostering meaningful engagement.

Remote

Part time



Community Moderation Manager



Ensure respectful and constructive community interactions by enforcing guidelines and fostering inclusivity within open-source projects.

Remote

Part time



Contributor Experience Manager



Enhance contributor experiences with streamlined onboarding, support programs, and recognition initiatives in open-source projects.

Remote

Part time



Open Source Talent Acquisition Manager



Drive community growth by recruiting passionate contributors for open-source AI projects. Shape a diverse, inclusive AI ecosystem.

Remote

Part time



Open Source Talent Onboarding Manager



Create seamless onboarding experiences for new contributors to ensure their success in Reality AI Lab's open-source projects.

Remote

Part time



Grant and Funding Manager



Secure grants and funding to advance Reality AI Lab's open-source AI tools and educational innovations.

Remote

Part time



Corporate Partnerships Manager



Build strategic corporate partnerships to support Reality AI Lab's mission and secure resources for open-source growth.

Remote

Part time



Academic Partnerships Manager



Collaborate with academic institutions to foster partnerships, research, and engagement for Reality AI Lab's open-source projects.

Remote

Part time



Product Marketing Manager



Craft and execute marketing strategies for Reality AI Lab's AI tools, connecting with global educators and contributors.

Remote

Part time



Social Media Manager



Amplify Reality AI Lab's mission and projects through engaging social media content, fostering a vibrant online community.

Remote

Part time



AI Educational Content Alignment Manager



Align generative AI tools with curriculum standards to enhance educational outcomes and empower teachers.

Remote

Part time



AI Education Quality Manager



Ensure high-quality, curriculum-aligned educational content for Marvel AI to support effective teaching and learning.

Remote

Part time



AI Educational Content Factuality Manager



Ensure the accuracy, credibility, and neutrality of AI-generated educational content for Marvel AI, maintaining high standards of reliability.

Remote

Part time



AI Education Privacy and Compliance Manager



Oversee data privacy and compliance for Marvel AI, ensuring adherence to regulations and protecting user data in educational settings.

Remote

Part time



Generative AI Ops Manager



Manage deployment, infrastructure, and workflows for Generative AI models, ensuring efficient and scalable operations for AI tools.

Remote

Part time



Generative AI Safety Manager



Ensure the ethical, safe, and compliant use of Generative AI models, focusing on mitigating risks and fostering responsible AI practices.

Remote

Part time



Generative AI Training Manager



Develop training programs to teach users and contributors how to effectively and responsibly use Generative AI tools in education.

Remote

Part time



Generative AI Evaluation Manager



Lead the evaluation of Generative AI models, assessing accuracy and quality to ensure they meet educational and user-focused standards.

Remote

Part time



Cybersecurity Manager



Oversee cybersecurity for open-source AI tools, ensuring data integrity, privacy, and protection against vulnerabilities.

Remote

Part time



Data Governance Manager



Lead data governance for open-source AI projects, ensuring quality, privacy, and compliance with relevant standards.

Remote

Part time



Product QA Manager



Manage QA processes for AI tools, ensuring reliability and usability through rigorous testing and quality assurance strategies.

Remote

Part time



Product DevOps Manager



Streamline infrastructure and CI/CD workflows for scalable and reliable open-source AI tools, ensuring seamless deployment.

Remote

Part time

