S
source · wttj·req · jb_ccfbc92dd0·listed 3d ago
Software Engineer (Data Infrastructure & Acquisition)
Speechify·Bristol, United Kingdom·Hybrid·Full-time
Sourced listing · wttjNo salary disclosed
compensation · not disclosed
Salary not shared
Sign up to see our estimate based on role, location, and seniority.
source · estimate pending
Summary
the pitchJoin Speechify, a leading AI company, as a Software Engineer focused on data collection for model training operations. You will be responsible for finding new sources of audio data, operating and extending our cloud infrastructure, and collaborating with scientists to improve data quality and scale. The ideal candidate has a BS/MS/PhD in Computer Science or a related field, 5+ years of industry experience in software development, and proficiency in Docker, Python, and Linux environments.
Role
posted by company- Ability to handle multiple tasks and adapt to changing priorities
- BS/MS/PhD in Computer Science or a related field
- Proficiency in Docker and Infrastructure-as-Code concepts and professional experience with at least one major Cloud Provider (we use GCP)
- Strong communication skills, both written and verbal
- 5+ years of industry experience in software development
- Proficiency with bash/Python scripting in Linux environments
- Experience with web crawlers, large-scale data processing workflows is a plus
Key responsibilities
- Collect and integrate new sources of audio data into the ingestion pipeline.
- Operate and extend the cloud infrastructure for the ingestion pipeline, currently running on GCP.
- Collaborate with scientists and the AI team to enhance data quality and scale for model training.