Senior Python Engineer - Functional Testing Project via Mindrift
Join Mindrift as part of a global network of experts supporting top tech firms with project-based AI development and testing. This fully remote freelance role focuses on building functional tests and Dockerized environments while leveraging advanced LLMs and modern Python tooling. Commitment is flexible at 20-30 hours per week, with task-based compensation of up to $40/hour* and the opportunity to contribute to high-impact AI initiatives.
Ideal candidates have 5+ years of Python engineering experience, advanced pytest and Docker skills, extensive Linux/Bash fluency, and the confidence to read and work with multiple coding languages using LLM aids like Roo Code and Claude Code. Proficiency in tools such as uv, pyproject.toml, Git Submodules, Dagger, and GitHub Codespaces is essential. English proficiency (B2 or higher) is required; experience with agent evaluation platforms and MCP CLI is a plus.
Engagement is via the Mindrift platform powered by Toloka AI. Rates depend on expertise, project phase, and assessment outcomes. Enjoy flexible scheduling, remote collaboration, and access to a supportive, innovative AI community.
Please submit your CV in English and indicate your level of English proficiency.
Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment.
About the Role
This project is suited for a Senior Python developer with deep functional testing experience, strong Linux and Docker skills, the ability to read code across multiple languages with the support of LLMs (e.g., C, Rust, Go) and translate requirements for migration tasks, and confidence using tools like Roo Code or Claude Code to accelerate iterative development.
Key Responsibilities
- Create functional black box tests for large codebases in various source languages
- Create and manage Docker environments to ensure 100% reproducible builds and test execution across different platforms
- Monitor code coverage and configure automated scoring criteria to meet industry benchmark-level standards
- Leverage LLMs (Roo Code, Claude) to accelerate development cycles, automate repetitive tasks, and improve overall code quality
- 5+ years of experience as a Software Engineer (primarily Python)
- Deep experience with pytest (fixtures, session-scoped, timeouts) and designing black-box functional tests for CLI tools
- Expert-level Docker skills (reproducible Dockerfiles, user contexts, secure workspaces)
- Strong Linux & Bash scripting skills and comfort debugging inside containers
- Proficiency with modern Python tooling (uv, pyproject.toml, packaging)
- Ability to read and understand with LLM many coding languages (for example C, C++, Rust, or Go)
- Experience using LLMs (Claude Code, Roo Code, Cursor) to accelerate iterative development and test-case generation
- English language - B2 or higher
Requirements +
- Prior experience with agent evaluation platforms and MCP CLI
Tools and Technologies: Python (pytest, uv, Pillow), Docker, Bash, Git Submodules, C/C++/Rust/Go (reading), Dagger, GitHub Codespaces, LLMs (Claude Code, Roo Code, Cursor), coverage.py, gcov, kcov.
What we can offer
- Freelance project-based collaboration via the Mindrift platform (powered by Toloka AI)
- Fully remote and flexible participation — choose when and how much to contribute (20-30 hours per week)
- Task-based compensation, equivalent to up to $40/hour* depending on performance and volume
- Opportunity to contribute to innovative AI projects for leading tech companies
- Supportive global community
*Note: Rates vary based on expertise, skills assessment, location, project needs, and other factors. Higher rates may be offered to highly specialized experts. Lower rates may apply during onboarding or non-core project phases. Payment details are shared per project.
Similar Jobs





