- Career Center Home
- Search Jobs
- Senior Robotics Data Engineer
Description
About VinDynamics
At VinDynamics, we design safe, affordable, and intelligent humanoid robots to assist in everyday life — robots for everyone. Backed by VinGroup, Vietnam’s leading technology conglomerate, we are on a mission to make advanced robotics accessible, reliable, and beneficial for billions of people worldwide. By combining cutting-edge AI, world-class engineering, and human-centered design, we aim to seamlessly integrate robots into daily life — enhancing safety, productivity, and happiness at home and beyond.
Overview
- Department: Data Department
- Company: VinDynamics
- Location: Reno, Nevada
- Reports to: Head of Data
VinDynamics is building general-purpose humanoid robots that learn from massive amounts of data collected from simulation, teleoperation, and real-world operation. We are seeking a Data Engineer to design, build, and maintain robust, scalable data pipelines that transform raw robot experience into high-quality datasets powering perception, manipulation, autonomy, and foundation models. This role focuses on engineering excellence, data ingestion, storage, validation, versioning, and retrieval ensuring that AI and robotics teams can train, evaluate, and iterate models efficiently and reliably.
Key Responsibilities
- Design and implement end-to-end data pipelines for humanoid robots, covering:
- Sensor data (RGB, depth, IMU, tactile, force/torque)
- Robot states, actions, rewards, and task metadata
- Language instructions and annotations (for VLA)
- Build scalable systems for data ingestion, storage, indexing, and retrieval (on-prem or hybrid).
- Ensure time synchronization, schema consistency, and data integrity across multi-modal streams.
- Develop data validation, cleaning, and quality-checking tools.
- Support dataset versioning, lineage tracking, and reproducibility for ML training and evaluation.
- Integrate data systems with ML pipelines, experiment tracking, and model training workflows.
- Optimize data access performance for large-scale training and simulation workloads.
- Collaborate closely with AI Manipulation, Autonomy, Perception, and Simulation teams to translate model needs into data solutions.
Requirements
Required Qualifications
- Bachelor’s / Master’s degree in computer science, Data Engineering, Robotics, or related fields.
- Strong experience building data pipelines and backend systems.
- Proficiency in Python; experience with data processing frameworks and databases.
- Solid understanding of distributed systems, storage, and data workflows.
- Experience working with large, multi-modal datasets.
- Strong engineering mindset with attention to reliability, scalability, and maintainability.
Preferred Qualifications
- Experience with robotics or embodied AI data.
- Familiarity with reinforcement learning, imitation learning, or multimodal datasets.
- Experience with real-time or time-synchronized data streams.
- Knowledge of ML infrastructure, MLOps, or experiment tracking tools.
- Experience supporting research-to-production AI systems.
