via Glassdoor
$94K - 156K a year
Evaluate AI-generated software engineering outputs, create test scenarios, provide detailed feedback, and collaborate with teams on AI system improvements.
Bachelor's degree and professional software engineering experience with high attention to detail and ability to commit 10+ hours weekly.
We’re supporting an AI innovation initiative seeking professional software engineers to help evaluate and improve software engineering-focused AI training data. In this role, you’ll apply your engineering expertise, systems thinking, and analytical skills to assess AI model outputs, test complex scenarios, and provide detailed feedback on code, architecture, and system behavior. This is a meaningful opportunity for software engineers interested in AI, research, and shaping next-generation generative AI systems. The role is fully remote, offers flexible scheduling, and allows you to directly impact AI systems used across technical domains. What You’ll Do • Create and refine software engineering–focused prompts and test scenarios for AI models • Evaluate AI-generated responses using structured rubrics and engineering frameworks • Review outputs for correctness, logical consistency, and real-world technical applicability • Identify errors, inefficiencies, or risks and provide actionable feedback • Collaborate with project teams and participate in weekly sync meetings • Contribute to the development and refinement of evaluation rubrics Ideal Candidate Profile • Strong background in software engineering, system architecture, or technical problem-solving • Precision-focused reviewer who can analyze complex technical outputs • Comfortable adapting to changing workloads and priorities • Skilled written communicator who explains technical concepts clearly • Experience working with LLMs or AI evaluation is a plus, but not required Required Qualifications • Bachelor’s degree or higher in Software Engineering, Computer Science, or related field • Professional experience in software engineering or technical systems analysis • High attention to detail and consistency in evaluations • Ability to commit at least 10+ hours/week with flexible scheduling • U.S.-based and authorized to work in the United States Preferred Qualifications • Experience working with AI, machine learning, or digital content evaluation • Familiarity with coding, debugging, or systems architecture • Interest in AI ethics, responsible innovation, or the future of technical AI applications Compensation & Schedule • Task-based pay: $15 per accepted task • Expect 3–6 tasks per hour (~$45–75/hr) • Variable weekly hours (average 5–20 hours; may peak at 40 based on workload) • Fully remote (U.S.-only) • 10-Week Contract About Ami Arroyo Recruiting Ami Arroyo Recruiting is committed to supporting candidates with transparency, respect, and personalized guidance. We connect exceptional talent with innovative teams, ensuring every candidate feels supported throughout the hiring process. We’re an Equal Opportunity Employer and welcome applicants from all backgrounds who are eligible to work in the U.S. Pay: $45.00 - $75.00 per hour Work Location: Remote
This job posting was last updated on 12/1/2025