Language Model Evaluation Intern (Remote) Job at Outlier AI, Arlington, TX

WmhPOGsvcmNqSGUxVUZ5MWtsOVM5WTk5VXc9PQ==
  • Outlier AI
  • Arlington, TX

Job Description

Join a global community of talented professionals to shape the future of AI. Earn up to $15 USD/hr and additional rewards based on quality of submission.

Outlier is committed to improving the intelligence & safety of AI models. Owned and operated by Scale AI , we’ve recently been featured in Forbes for partnering experts with top AI labs to provide the high quality data for LLMs. We believe AI can only perform as well as the data it’s trained on. That’s why we work with contributors from all over the world , who help improve AI models by providing expert human feedback . This data has led to AI advancements for the world's leading AI labs and large language model builders.

We’ve built a best-in-class remote work platform for our freelance contributors to provide valuable, specialized skills, and we in turn strive to provide them with a positive experience based on our core pillars of reliability, transparency, and flexibility.

What you will be doing

We are looking for someone who speaks fluent English to contribute their expertise toward training and refining cutting-edge AI systems.

  • Adopt a “user mindset” to produce natural data to meet the realistic needs you have or would use AI for.
  • Use the tool of rubrics to address user needs in a structured way.
  • Evaluate AI outputs by reviewing and ranking reasoning and problem-solving responses from large language models.
  • Contribute across projects depending on your specific skillset and experience.

What we’re looking for

  • Education : Bachelor’s degree or higher (or currently enrolled).
  • Analytical and Problem-Solving Skills : Ability to develop complex, professional-level prompts and evaluate nuanced AI reasoning.
  • Strong Writing : Clear, concise, and engaging writing to explain decisions or critique responses.
  • Attention to Detail : Commitment to accuracy and ability to assess technical aspects of model outputs.

    Nice to Haves:
  • Experience in fields like literature, creative writing, history, philosophy, theology, etc.
  • Prior writing or editorial experience (content strategist, technical writer, editor, etc.).
  • Interest or background in AI, machine learning, or creative tech tools.

Compensation and benefits

Earn up to $15 USD/hr, paid out weekly

Rates vary based on quality, accuracy, and time spent. Paid via PayPal & AirTM

Free access to
Model Playground

Interact, experiment and engage with leading large language models free of cost

Flexible schedule and
time commitment

No contracts, no 9-to-5. You control your schedule. (Most experts spend 5-10 hours/week, up to 40 hours working from home

Join a global community of
Coding experts

Join a global network of experts contributing to advanced AI tools

Disclaimer: For non-core work, such as during initial project onboarding or project overtime phases, lower rates may apply. Certain projects offer incentive payments. Please review the payment terms for each project.

Equal Opportunity Employer: Outlier is committed to fostering a diverse and inclusive work environment. We welcome applicants from all backgrounds and celebrate diversity in our workforce.

Job Tags

Weekly pay, Freelance, Internship, Remote work, Work from home, Flexible hours,

Similar Jobs

M&D Distributors Inc

Inside Sales Representative Job at M&D Distributors Inc

 ..., sensors and other engine parts. Our strong remanufacturing roots combined with our 48 branch locations, a nationwide outside sales team of 25 and our close OEM & Manufacturer partnerships make us unique in the industry no one understands diesel engine failure analysis... 

Talent Edge Recruiting

Marketing & Brand Manager Job at Talent Edge Recruiting

 ...Job Description Marketing & Brand Manager Onsite | Chicago, IL Full-Time | $90,000$100,000 A high-growth commercial real estate investment firm is seeking a Marketing & Brand Manager to lead company-wide marketing initiatives, elevate brand presence, and... 

Merican Inc

UiPath Automation Technical Lead Job at Merican Inc

UiPath Automation Technical Lead Remote Automation implementation on Payer business UiPath based automation implementation Artificial Intelligence (AI)

North Slope Borough

Museum Curator Job at North Slope Borough

Manage all NSB museum collections, supervise museum staff, act as building manager, participate in writing of grant proposals and administer resulting projects, and participate in both short and long-term planning for museum exhibits and educational programs. This position... 

3MG Roofing & Solar

Welder/Fabricator - Warehouse Lead Job at 3MG Roofing & Solar

 ...Summary: We are seeking a results-driven Welder/Fabricator Warehouse Lead to join our team. This role oversees all aspects of day-...  .... Oversee daily fabrication operations, including production scheduling, workflow, and resource allocation. Ensure compliance...