Position Description
. Proven experience in testing AI/ML-powered applications, with a strong understanding of how to validate non-deterministic systems.
. Hands-on experience with LLM testing tools such as RAGAs, DeepChecks, OpenAI Evals, or DeepEval is highly desirable.
. Expertise in creating test plans, designing test cases, and executing functional, performance, and regression tests for AI models, focusing on bias, accuracy, and robustness.
. Strong proficiency in Python and frameworks like PyTest for building automated test suites.
. Good experience testing RESTful APIs and backend services to ensure seamless data flow and integration.
. Familiarity with CI/CD pipelines (Jenkins, Kubernetes) and cloud platforms (GCP, AWS) is a major plus.
. A methodical approach to identifying, documenting, and tracking bugs in complex AI systems.