Recordly.AI
Speak. Transcribe. Illuminate.
Meet Recordly.AI. Experience the award-winning innovation! Recordly.AI, the world's first Unified Audio & Video Intelligence Platform.
Join Recordly.ai Pioneering the Future of Speech and Audio AI
At Recordly.ai, we are at the forefront of building industry-leading Speech and Audio foundation models that power our cutting-edge transcription, captioning, and translation services.
As an AI Engineer specializing in Speech and Audio, you will have the unique opportunity to advance state-of-the-art research, develop foundational models, and integrate them into products that transform the way people and businesses interact with AI.
Our Mission :
Recordly.ai's mission is to create the most advanced Speech and Audio foundation models that enable unparalleled transcription accuracy and natural-sounding speech synthesis.
Our current focus is on enhancing speech recognition and synthesis to deliver human-like voices and nuanced transcriptions, making AI interactions more intuitive and impactful.
Where We Stand :
We have already built and fine-tuned an accurate speech model capable of recognizing speech with 98% accuracy & another model for generating high-quality voice outputs from 10 seconds of input.
We support advanced deep voice cloning using 30-60 minutes of voice data, setting new standards in AI-driven speech synthesis and transcription.
Job Duties
As a Senior Fullstack and AI Engineer at Recordly, you will integrate advanced AI technologies into innovative web and mobile applications while spearheading the training and fine-tuning of cutting-edge AI models for audio and video data processing.
Key responsibilities include :
- Developing and integrating AI models for tasks such as speech recognition, emotion detection, video summarization, and content generation , focusing on achieving high performance, scalability, and real-time latency requirements.
- Designing and implementing machine learning algorithms and training state-of-the-art Turkish and English speech recognition models on large datasets, followed by rigorous evaluation of their performance.
- Creating deep learning models that optimize audio and video processing pipelines , significantly enhancing application responsiveness and overall user experience.
- Conducting original research to address unsolved real-world problems in speech recognition and advancing the state-of-the-art for use cases involving multiple languages , including Turkish and English.
- Developing machine learning models for both server-based and embedded systems, ensuring they meet stringent performance and efficiency standards.
- Leading the deployment of AI models in production environments, ensuring they meet real-time latency, robustness, and efficiency requirements.
- Researching and implementing advanced techniques in transfer learning, multimodal learning, and domain adaptation to improve model accuracy across varied datasets, particularly for low-resource scenarios.
- Creating and optimizing data augmentation and preprocessing strategies to improve training data quality, ensuring robustness even in low-resource scenarios.
- Selecting and optimizing neural network architectures (e.g., CNNs, RNNs, Transformers) to meet specific project objectives, with a focus on audio and video AI applications.
- Collaborating with cross-functional teams to seamlessly integrate AI models into production systems, utilizing frameworks such as TensorFlow and PyTorch.
- Building and scaling web applications using React.js, Node.js, and Nest.js , and mobile applications with React Native, incorporating AI-driven features to enhance user engagement.
- Managing end-to-end service deployment and optimization using Docker, Kubernetes, and CI / CD tools to ensure reliable operation across cloud platforms like AWS and GCP.
- Mentoring junior team members in best practices for AI model development, deployment, and full-stack development, fostering a culture of continuous learning and innovation.
- Utilizing expertise in creating mobile applications that offer personalized user experiences, enhancing AI-based features to boost user engagement on digital platforms.
- Implementing advanced security protocols to safeguard AI models and data against adversarial attacks and ensure compliance with international data privacy regulations.
- Leading cross-disciplinary research collaborations to innovate and refine AI methodologies, particularly in natural language processing (NLP) and computer vision, to expand the applications of AI models in emerging fields.
Education Required
Master’s degree or higher in Computer Science, Machine Learning, Electrical Engineering, or a closely related field.
Training Required
Specialized Training : Formal training in advanced machine learning, with a focus on deep learning for audio and video processing.
This training should include hands-on experience with AI frameworks such as TensorFlow, PyTorch.
Experience Required
- 5 years of progressive experience in AI engineering roles, with a specialization in training and fine-tuning machine learning models for audio and video data.
- Proven track record in deploying scalable, high-performance AI models in production environments, meeting real-time processing requirements.
- Demonstrated experience in leading Speech AI projects that have been successfully integrated into commercial products or enterprise-level systems.
Special Requirements
- Strong proficiency in neural network architectures (CNNs, RNNs, Transformers) and their application in audio and video AI tasks.
- Expertise in programming languages including Python, Java, JavaScript, TypeScript, C#, SQL, and frameworks such as React, React Native, Spring Boot, Node.
js, Nest.js, NoSQL, and PostgreSQL.
- Extensive experience with cloud platforms (AWS, GCP) and tools such as Docker, Kubernetes, Lens, CI / CD, Sonar, Grafana, New Relic, Kibana, OpenCV, and FFmpeg.
- Advanced knowledge in signal processing, audio feature extraction, and video encoding / decoding.
- Strong understanding of data privacy regulations and ethical AI considerations, especially concerning multimedia data.
- Proven ability to conduct independent research, publish findings in peer-reviewed journals, and present at industry conferences.
- Experience in optimizing AI models for embedded systems and IoT devices, ensuring performance in resource-constrained environments.
- Fluency in implementing AI-driven solutions that comply with ISO 27001 and GDPR standards.
Foreign Language Requirement
- Fluent / Native in Turkish
- Fluent / Native in English