14 Jobs für Evaluation Engineer
Stellenangebote Evaluation Engineer Jobs
Job vor 2 Tagen bei Jooble gefunden
FERCHAU GmbH Niederlassung Augsburg
• Landkreis Landsberg am Lech; Region München, Bayern; Regierungsbezirk Oberbayern; Bayern
Abgeschlossenes Studium
Flexible Arbeitszeiten Tarifvertrag
[. .. ] Stufe bringen möchte. Wir realisieren spannende Projekte für namhafte Kunden in allen Technologiebereichen und für alle Branchen und überzeugen täglich mit fundierter Expertise und fachlichem Know-how. Io [...]
MEHR INFOS ZUM STELLENANGEBOT
[...] T Test and Evaluation Engineer (m/w/d) Dein Aufgabengebiet Definition und Durchführung von Integrationstests für Io T-Geräte Koordination mit Prüfabteilungen zur Auswertung von Testergebnissen Unterstützung bei Anwender:innenanfragen zu Io T-Komponenten über Jira Verwaltung und Monitoring von Testgeräten und Langzeittests, Dokumentation in Confluence Entwicklung und Pflege von Testautomatisierungs-Frameworks mit pytest und C Simulation von [. .. ]
▶ Zur Stellenanzeige
Job vor 3 Tagen bei Jooble gefunden
FERCHAU GmbH
IoT Test and Evaluation Engineer (m/w/d)
• Landkreis Landsberg am Lech; Region München, Bayern; Regierungsbezirk Oberbayern; Bayern
Abgeschlossenes Studium
Flexible Arbeitszeiten Tarifvertrag
Dein Aufgabengebiet Definition und Durchführung von Integrationstests für Io T-Geräte Koordination mit Prüfabteilungen zur Auswertung von Testergebnissen Unterstützung bei Anwender:innenanfragen zu Io T-Komponenten über Jira Verwaltung und Monitoring von Testgeräten [...]
MEHR INFOS ZUM STELLENANGEBOT
[...] und Langzeittests, Dokumentation in Confluence Entwicklung und Pflege von Testautom [. .. ]
▶ Zur Stellenanzeige
Job am 25.04.2026 bei Jooble gefunden
Mindrift
• Region Stuttgart, Württemberg; Regierungsbezirk Stuttgart; Württemberg Stuttgart, DE
Freiberuflich
[. .. ] employment. What this opportunity involves We re building a dataset to evaluate AI coding agents how well a model handles real-world developer tasks. You ll create challenging [...]
MEHR INFOS ZUM STELLENANGEBOT
[...] tasks and evaluation criteria within realistic simulated environments: Build virtual companies following a high-level plan-codebase, infrastructure, and context (conversations, documentation, tickets) that form a realistic environment with development history Assemble and calibrate tasks from intermediate states of the virtual company: craft the prompt, define evaluation criteria, and ensure the task is [. .. ]
▶ Zur Stellenanzeige
Job am 25.04.2026 bei Jooble gefunden
Mindrift
Freelance Agent Evaluation Engineer
• Region München, Bayern; Regierungsbezirk Oberbayern; Bayern München, DE
Freiberuflich
[. .. ] employment. What this opportunity involves We re building a dataset to evaluate AI coding agents how well a model handles real-world developer tasks. You ll create challenging [...]
MEHR INFOS ZUM STELLENANGEBOT
[...] tasks and evaluation criteria within realistic simulated environments: Build virtual companies following a high-level plan-codebase, infrastructure, and context (conversations, documentation, tickets) that form a realistic environment with development history Assemble and calibrate tasks from intermediate states of the virtual company: craft the prompt, define evaluation criteria, and ensure the task is [. .. ]
▶ Zur Stellenanzeige
Job am 25.04.2026 bei Jooble gefunden
Mindrift
Freelance Agent Evaluation Engineer
• Hamburg Hamburg, DE
Freiberuflich
[. .. ] employment. What this opportunity involves We re building a dataset to evaluate AI coding agents how well a model handles real-world developer tasks. You ll create challenging [...]
MEHR INFOS ZUM STELLENANGEBOT
[...] tasks and evaluation criteria within realistic simulated environments: Build virtual companies following a high-level plan-codebase, infrastructure, and context (conversations, documentation, tickets) that form a realistic environment with development history Assemble and calibrate tasks from intermediate states of the virtual company: craft the prompt, define evaluation criteria, and ensure the task is [. .. ]
▶ Zur Stellenanzeige
Job am 25.04.2026 bei Jooble gefunden
Mindrift
Freelance Agent Evaluation Engineer
• Berlin Berlin, DE
Freiberuflich
[. .. ] employment. What this opportunity involves We re building a dataset to evaluate AI coding agents how well a model handles real-world developer tasks. You ll create challenging [...]
MEHR INFOS ZUM STELLENANGEBOT
[...] tasks and evaluation criteria within realistic simulated environments: Build virtual companies following a high-level plan-codebase, infrastructure, and context (conversations, documentation, tickets) that form a realistic environment with development history Assemble and calibrate tasks from intermediate states of the virtual company: craft the prompt, define evaluation criteria, and ensure the task is [. .. ]
▶ Zur Stellenanzeige
Neu Job vor 4 Std. bei Jobleads gefunden
Freelance Agent Evaluation Engineer
• Hamburg
Freiberuflich
[. .. ] not permanent employment. What This Opportunity Involves Were building a dataset to evaluate AI coding agents-how well a model handles real-world developer tasks. Youll create challenging tasks [...]
MEHR INFOS ZUM STELLENANGEBOT
[...] and evaluation criteria within realistic simulated environments: Build virtual companies following a high-level plan-codebase, infrastructure, and context (conversations, documentation, tickets) that form a realistic environment with development history Assemble and calibrate tasks from intermediate states of the virtual company: craft the prompt, define evaluation criteria, and ensure the task is [. .. ]
▶ Zur Stellenanzeige
Neu Job vor 4 Std. bei Jobleads gefunden
IoT Test and Evaluation Engineer (m/w/d)
• Kaufering, Bayern
Abgeschlossenes Studium
Flexible Arbeitszeiten Tarifvertrag
Stellendetails zu: Io T Test and
Evaluation Engineer (m/w/d) Location Kaufering Employment Type Vollzeit Job Description Menschen und Technologien zu verbinden, den Perfect Match für unsere Kunden zu gestalten, immer [...]
MEHR INFOS ZUM STELLENANGEBOT
[...] die richtigen Expert:innen für die jeweilige Herausforderung zu finden-das ist unser Anspruch bei FERCHAU und dafür suchen wir dich: als ambitionierte:r Kolleg:in, der:die wie wir [. .. ]
▶ Zur Stellenanzeige
Job vor 8 Tagen bei Jobleads gefunden
Freelance Agent Evaluation Engineer
• Berlin
Freiberuflich
[. .. ] not permanent employment. What This Opportunity Involves Were building a dataset to evaluate AI coding agents-how well a model handles real-world developer tasks. Youll create challenging tasks [...]
MEHR INFOS ZUM STELLENANGEBOT
[...] and evaluation criteria within realistic simulated environments: Build virtual companies following a high-level plan-codebase, infrastructure, and context (conversations, documentation, tickets) that form a realistic environment with development history Assemble and calibrate tasks from intermediate states of the virtual company: craft the prompt, define evaluation criteria, and ensure the task is [. .. ]
▶ Zur Stellenanzeige
Job vor 8 Tagen bei Jobleads gefunden
Freelance Agent Evaluation Engineer
• München, Bayern
Freiberuflich
[. .. ] not permanent employment. What This Opportunity Involves Were building a dataset to evaluate AI coding agents-how well a model handles real-world developer tasks. Youll create challenging tasks [...]
MEHR INFOS ZUM STELLENANGEBOT
[...] and evaluation criteria within realistic simulated environments: Build virtual companies following a high-level plan-codebase, infrastructure, and context (conversations, documentation, tickets) that form a realistic environment with development history Assemble and calibrate tasks from intermediate states of the virtual company: craft the prompt, define evaluation criteria, and ensure the task is [. .. ]
▶ Zur Stellenanzeige
Job am 21.04.2026 bei Jooble gefunden
Auxilius. ai
AI Engineer for LLM Ops Evaluation (m/f/d)
• Region München, Bayern; Regierungsbezirk Oberbayern; Bayern München, DE
[. .. ] market fit. We build cutting-edge AI solutions for Governance, Risk and Compliance (GRC) for enterprises around the world. Our customers are auditors, risk managers, and compliance teams, [...]
MEHR INFOS ZUM STELLENANGEBOT
[...] which means evaluation rigor, auditability, and EU AI Act readiness aren t afterthoughts for us. They re product requirements. Tasks As our AI Engineer for LLMOps Evaluation, you ll own the LLMOps pipeline end-to-end and work directly alongside our founding team. You will: Own the LLMOps pipeline: Evaluate infrastructure, prompt optimization loop, and the production integration that turns experiments into reliable customer-facing features Design evaluation strategy per output type: Decide when [. .. ]
▶ Zur Stellenanzeige
Job vor 11 Tagen bei Jobleads gefunden
Senior AI Software Engineer-Model Evaluation (f/m/d)
• Heidelberg, Baden- Württemberg
[. .. ] models. While we highly value inperson work, we offer flexibility to work from Berlin or elsewhere in Germany, with regular travel to onsite events. Your responsibilities As [...]
MEHR INFOS ZUM STELLENANGEBOT
[...] an AI Software Engineer in Model Evaluation, you will help design, implement, and scale the systems that measure our models performance at the cutting edge. You will work closely with researchers to create evaluation benchmarks, datasets, and environments that test model capabilities, safety, and reliability across tasks from multilingual understanding to mathematical reasoning and creativity. You will [. .. ]
▶ Zur Stellenanzeige
Job am 24.04.2026 bei Jobleads gefunden
AI Engineer for LLM Ops Evaluation (m/f/d)
• München, Bayern
Beratungs-/ Consultingtätigkeiten
[. .. ] market fit. We build cutting-edge AI solutions for Governance, Risk and Compliance (GRC) for enterprises around the world. Our customers are auditors, risk managers, and compliance teams, [...]
MEHR INFOS ZUM STELLENANGEBOT
[...] which means evaluation rigor, auditability, and EU AI Act readiness arent afterthoughts for us. Theyre product requirements. Tasks As our AI Engineer for LLMOps Evaluation, youll own the LLMOps pipeline end-to-end and work directly alongside our founding team. You will: Own the LLMOps pipeline: Evaluate infrastructure, prompt optimization loop, and the production integration that turns experiments into reliable customer-facing features Design evaluation strategy per output type: Decide when to [. .. ]
▶ Zur Stellenanzeige
Job am 02.03.2026 bei Jobleads gefunden
Senior AI Systems Engineer: Production-Grade LLM Evaluation
• Zürich
A leading fintech company in Switzerland seeks an AI Systems
Engineer to enhance AI accuracy and deploy models at scale. The successful candidate will manage the prompt
evaluation lifecycle [...]
MEHR INFOS ZUM STELLENANGEBOT
[...] and develop custom metrics to increase quality in document processing. A degree in Data Science or a related field is required, along with strong Python skills and experience with LLM evaluation techniques. Candidates must be capable of systematic analysis and have an understanding of prompt engineering principles. In-person [. .. ]
▶ Zur Stellenanzeige

Häufig gestellte Fragen
Wieviel verdient man als Evaluation Engineer pro Jahr?
Als Evaluation Engineer verdient man zwischen EUR 55.000,- bis EUR 85.000,- im Jahr.
Wieviele offene Stellenangebote gibt es für Evaluation Engineer Jobs bei unserer Jobsuche?
Aktuell gibt es auf JobRobot 14 offene Stellenanzeigen für Evaluation Engineer Jobs.
Wieviele Unternehmen suchen nach Bewerbern für Evaluation Engineer Jobs?
Aktuell suchen 5 Unternehmen nach Bewerbern für Evaluation Engineer Jobs.
Welche Unternehmen suchen nach Bewerbern für Evaluation Engineer Stellenangebote?
Aktuell suchen zum Beispiel folgende Unternehmen nach Bewerbern für Evaluation Engineer Stellenangebote:
- Mindrift (4 Jobs)
- FERCHAU GmbH Niederlassung Augsburg (1 Job)
- FERCHAU GmbH (1 Job)
- Auxilius. ai (1 Job)
In welchen Bundesländern werden die meisten Evaluation Engineer Jobs angeboten?
Die meisten Stellenanzeigen für Evaluation Engineer Jobs werden derzeit in Bayern (7 Jobs), Baden-Württemberg (2 Jobs) und Hamburg (2 Jobs) angeboten.
Zu welchem Berufsfeld gehören Evaluation Engineer Jobs?
Evaluation Engineer Jobs gehören zum Berufsfeld Technik & Ingenieurwesen.