This page was automatically translated and may contain errors. View in English.
心灵漂移

Freelance Agent Evaluation Engineer

Mindrift

Qatar · 自由职业者

抢先申请

经验
5年以上
薪水
USD 50 / hour
职位空缺
1
发布
2周前
工作模式
在办公室
合格
Professionals with at least 5 years of software development experience and proficiency in Python, JavaScript/TypeScript, Docker, Postgres, Kafka, and Redis. Candidates must also have experience in writing tests and possess B2+ English proficiency.
恢复
需要申请

职位描述

About Mindrift

Mindrift specializes in connecting skilled professionals with project-based opportunities in artificial intelligence, focusing on the testing, evaluation, and enhancement of AI systems for prominent technology firms. Participation is structured around specific projects rather than permanent employment.

Project Overview: AI Coding Agent Evaluation

This project involves the creation of a comprehensive dataset designed to assess the capabilities of AI coding agents. The goal is to determine how effectively these agents can handle authentic developer tasks.

Key Responsibilities

  • Construct realistic developer environments, simulating a virtual company with a complete codebase, necessary infrastructure, and contextual information (including tickets, documentation, and communications) to establish a credible development history.
  • Develop challenging tasks and define precise evaluation criteria within these simulated environments. This includes crafting effective prompts and establishing clear definitions of what constitutes a

如果您希望收到回复,请留下您的信息——我们不会将您的信息用于其他用途。

点击浏览拖放,或 粘贴 截图

PNG、JPG、GIF、MP4、WebM、MOV 格式 · 每个文件最大 20MB · 最多 5 个文件

🤖
布罗克瑟助理
在线·即时人工智能帮助
🤖
由 AI 提供支持 · 来自 Broxer Help 的解答