
Remote
Contract
India
About the Role
Sr. SW Engineer – Ruby (LLM Evaluation & Repository Validation)
About the projects: we are building LLM evaluation and training datasets to train LLM to work on realistic software engineering problems. One of our approaches, in this project, is to build verifiable SWE tasks based on public repository histories in a synthetic approach with human-in-the-loop; while expanding the dataset coverage to different types of tasks in terms of programming language, difficulty level, and etc.
About the Role: We are looking for experienced software engineers (tech lead level) who are familiar with high-quality public GitHub repositories and can contribute to this project. This role involves hands-on software engineering work, including development environment automation, issue triaging, and evaluating test coverage and quality
What does day-to-day look like:
Analyze and triage GitHub issues across trending open-source libraries.
Set up and configure code repositories, including Dockerization and environment setup.
Evaluating unit test coverage and quality.
Modify and run codebases locally to assess LLM performance in bug-fixing scenarios.
Collaborate with researchers to design and identify repositories and issues that are challenging for LLMs.
Opportunities to lead a team of junior engineers to collaborate on projects.
Required Skills:
Minimum 5+ years of overall experience
Strong experience with at least one of the following languages: Ruby
Proficiency with Git, Docker, and basic software pipeline setup.
Ability to understand and navigate complex codebases.
Comfortable running, modifying, and testing real-world projects locally.
Experience contributing to or evaluating open-source projects is a plus.
Nice to Have:
Previous participation in LLM research or evaluation projects.
Experience building or testing developer tools or automation agents.
Mandatory Skills
3 - 4+ years of relevant software development experience.
Ruby - min 3+ yrs exp
This is a short term remote contract opportunity that requires working at least 20 to 40 hours (part time/ full time) in week the US Pacific Time Zone. If this role suits you, please email your resume to [email protected]. kindly mention your current CTC, expected CTC and Notice Period in your email
About the projects: we are building LLM evaluation and training datasets to train LLM to work on realistic software engineering problems. One of our approaches, in this project, is to build verifiable SWE tasks based on public repository histories in a synthetic approach with human-in-the-loop; while expanding the dataset coverage to different types of tasks in terms of programming language, difficulty level, and etc.
About the Role: We are looking for experienced software engineers (tech lead level) who are familiar with high-quality public GitHub repositories and can contribute to this project. This role involves hands-on software engineering work, including development environment automation, issue triaging, and evaluating test coverage and quality
What does day-to-day look like:
Analyze and triage GitHub issues across trending open-source libraries.
Set up and configure code repositories, including Dockerization and environment setup.
Evaluating unit test coverage and quality.
Modify and run codebases locally to assess LLM performance in bug-fixing scenarios.
Collaborate with researchers to design and identify repositories and issues that are challenging for LLMs.
Opportunities to lead a team of junior engineers to collaborate on projects.
Required Skills:
Minimum 5+ years of overall experience
Strong experience with at least one of the following languages: Ruby
Proficiency with Git, Docker, and basic software pipeline setup.
Ability to understand and navigate complex codebases.
Comfortable running, modifying, and testing real-world projects locally.
Experience contributing to or evaluating open-source projects is a plus.
Nice to Have:
Previous participation in LLM research or evaluation projects.
Experience building or testing developer tools or automation agents.
Mandatory Skills
3 - 4+ years of relevant software development experience.
Ruby - min 3+ yrs exp
This is a short term remote contract opportunity that requires working at least 20 to 40 hours (part time/ full time) in week the US Pacific Time Zone. If this role suits you, please email your resume to [email protected]. kindly mention your current CTC, expected CTC and Notice Period in your email