Software Engineer - Data

Voltai
Palo Alto, CA, United States
$150K-$350K a year
Full-time
We are sorry. The job offer you are looking for is no longer available.

About Voltai

Voltai’s mission is to re-build the physical world through developing super-intelligence to accelerate the pace of hardware innovation.

Our focus is to build frontier models that could understand one of the world’s most complex technologies semiconductor and electronics.

About the Role

We’re looking for a Data Engineer who thrives in tackling complex challenges and is passionate about building innovative data systems.

As a Data Engineer at Voltai, your primary responsibility will be to build the world’s largest semiconductor dataset, leveraging your expertise in data pipelines and scalable infrastructure.

You will play a critical role in preparing data for our Machine Learning team and developing systems that manage vast amounts of information across various modalities such as text, images, circuits, and more.

Key Responsibilities :

  • Build and manage the world’s largest semiconductor dataset.
  • Develop crawlers to scrape data at an internet scale.
  • Extract and clean information from diverse modalities, including text, images, circuits, simulations, and signals.
  • Prepare and preprocess data for the Machine Learning team.
  • Build systems to handle the transfer of customer data and feedback.
  • Parse documents across various formats and structures.
  • Develop data pipelines for data labelers and manage workloads across large cloud compute clusters.
  • Implement and maintain systems for pre-processing datasets for AI training.

Required Skillsets :

  • Proven experience in building scalable data pipelines.
  • Expertise in PDF parsing and data extraction.
  • Strong engineering skills with a passion for improving data and model performance.
  • Experience working with modalities beyond text and demonstrating exceptional work in those areas.
  • Ability to build custom data processing libraries from scratch.
  • Keeping up with state-of-the-art techniques for preparing AI training data.
  • Proficiency in organizing and meticulously managing data across multiple clouds, modalities, and sources.

Bonus Points :

  • Background in Electrical Engineering.
  • Experience in connecting machine learning model behavior to data distribution and data quality.
  • Experience in fine-tuning large language models.
  • Experience at a hyper-growth startup.
  • Experience building data pipelines for training foundation models.

Compensation Philosophy

At Voltai, we believe that exceptional work deserves exceptional rewards. Our compensation structure reflects the value each team member brings to our pioneering efforts in the semiconductor and AI industries.

For this role, we anticipate the starting annual base salary to be within the range of $150,000 to $350,000, adjusted according to the candidate's experience, expertise, and impact potential.

The final offer may vary to ensure alignment with individual contributions and the long-term success of our mission.

Our Benefits

At Voltai, we believe in taking care of our team so they can focus on pushing the boundaries of innovation. Our benefits package is designed to support your well-being and fuel your professional growth.

  • Unlimited PTO : We trust you to manage your time and know when you need a break. Recharge when you need it, no questions asked.
  • Comprehensive Health Coverage : Your health matters. We offer top-tier medical and dental insurance to keep you and your loved ones covered.
  • Commitment to Your Growth : At Voltai, we’re dedicated to your continuous learning and development. Whether it’s through challenging projects or opportunities for professional advancement, we invest in your journey to becoming a leader in your field.
  • 25 days ago
Related jobs
Promoted
Apple
Sunnyvale, California

Would you like to work in an energizing environment where your abilities will be challenged on a day-to-day basis? If so, Apple's IS&T Ai & Data Platforms team is looking for highly motivated, detail oriented, technical savvy, results-oriented professionals who like to think creatively and want to b...

Aurora
Mountain View, California

Software Engineer - Autonomy Data: Continuous Learning. Collaborate with autonomy engineers to improve the quality and composition of our datasets. Optimize data pipelines handling sensor data from millions of miles of on-road. Support feature development and for our labeling applications and data a...

Snowflake
San Mateo, California

AS A SENIOR SOFTWARE ENGINEER - DATA PRIVACY, YOU WILL:. AS A SENIOR SOFTWARE ENGINEER - DATA PRIVACY, YOU HAVE:. The Data Privacy team is responsible for integrating privacy-enhancing technologies (PETs) like Differential Privacy, Anonymization, and Synthetic Data Generation into Snowflake. We enab...

Abbott
Milpitas, California

Our Diabetes division currently has an opportunity for a Senior Software Test Engineer. Perform exploratory testing, system level end to end testing, develop test datasets and execute automation scripts (to ensure application software releases are of high quality). Specifically verify data visualiza...

BHO Tech
San Mateo, California

As a data engineer, you’ll build the systems responsible for processing massive behavioral datasets and serving analytics queries in sub-second time. Work with a data scientist to implement a novel data correction in production. Build, from scratch, a processing pipeline for a new dataset. Run a bak...

Apple
Sunnyvale, California

As a Framework Software Engineer, you will be responsible for building various tools and features for Data and ML platforms, including data processing, insights portal, data observability, data lineage, model hub, and data visualization. Would you like to work in a fast-paced environment where your ...

Second Measure
San Mateo, California

As a senior data engineer at Second Measure, you will be a critical technical contributor in the team responsible for building, managing data pipelines, and enriching transaction data. Full Time] Senior Software Engineer (Data) at Second Measure (United States). Senior Software Engineer (Data). We a...

Games Jobs Direct
San Mateo, California

As a Principal Data Engineer, you will work to define the data ontology for all of Roblox, establish standard methodologies for data operations and lifecycle management, design and build analytics tooling and frameworks, and influence event instrumentation. The Data Engineering team at Roblox plays ...

TikTok
San Jose, California

Minimum Qualifications:- Bachelor's Degree or above, majoring in Computer Science, or related fields, with 3+ years of experience building scalable systems;- Proficiency in common big data processing systems like Spark/Flink at the source code level is required, with a preference for experience in c...

Apple
Cupertino, California

As a core member of the Data Engineering team you will be responsible for designing and implementing features that rely on processing and serving very large datasets with an awareness of scalability. The team’s data-driven engineers focus relentlessly on the customer experience by running worldwide ...