Systems&Infrastructure Specialist for AI Model Training, Remote
Systems&Infrastructure Specialist for AI Model Training, Remote
-
Remote, USA
-
Last edited: yesterday
-
Save
Description
In this role, you will leverage your expertise to enhance the training of next-generation AI systems. Your contributions will directly influence how models learn, reason, and perform by providing high-quality, real-world input. No prior experience in AI is necessary; your domain knowledge is the key asset. Key Responsibilities: Navigate, troubleshoot, and recover dynamic infrastructure and long-running processes in real-time using command-line tools. Master and manage highly containerized environments, including orchestrating Dockerized sandboxes and CI/CD workflows. Build, maintain, and optimize systems for AI model training and high-throughput compute environments. Respond swiftly to system errors, executing dynamic mid-operation replanning and recovery. Collaborate with engineering and AI teams to ensure seamless integration, reliability, and performance. Document system architectures, incident responses, and recovery protocols with meticulous clarity. Contribute expertise to evolving project needs, adapting to new technologies and scaling strategies as required. Qualifications: Demonstrated expert proficiency working in terminal environments for system builds, server administration, and infrastructure management. Advanced problem-solving skills for multi-step troubleshooting, filesystem navigation, and process management within containerized settings. Hands-on experience with Python, Bash, JavaScript/TypeScript, Go, Rust, and/or C/C++. Deep familiarity with build systems, package managers, databases, web servers, ML frameworks, version control, and cryptography tools. Proven ability to execute dynamic infrastructure recovery and optimize long-running processes under pressure. Strong written and verbal communication skills, with a passion for precise technical documentation. Systems multilingualism: versatility across operating systems, languages, and emerging DevOps tools. Preferred Qualifications: Prior experience in high-compute environments for AI/ML workloads. Background in Site Reliability Engineering or DevOps roles focused on mission-critical infrastructure. Familiarity with advanced container orchestration and distributed system design. Work Terms: Contract position with remote work flexibility. Compensation: $40 - $70 per hour. Eligibility: Open to candidates with relevant expertise; no prior AI experience required.
Highlights
-
Job positionSystems&Infrastructure Specialist for AI Model Training
More details
-
This is a contract job.
Safety Tips
Be careful with multilevel marketing programs, and their income projections.
More info about this ad
Systems&Infrastructure Specialist for AI Model Training has been posted in the Escondido Education & Training category on Locanto.
In this category, there are no other ads right now posted in Escondido.
There are more ads within a 10 mi radius for this category. If you want to view those ads, click here.