AI Specialist

(205 days ago)

Respina

Tehran/ Arjantin

Full Time

Working days and hours

Saturday to Tuesday from 8:00 to 17:00, Wednesday 8:00 to 16:00.

Business trips

Facilities and Benefits

Loan -Health insurance -Occasional packages and gifts

درباره شرکت

Company Size

201 - 500 employees

Industry

Internet Provider / E-commerce / Online Services

Company Type

Iranian company dealing only with Iranian entities

Establishment year

1381

Brand

Respina

Ownership type

Privately held

توضیحات بیشتر

key Requirements

2 years experience in similar position

Bachelor Computer and IT or Electrical Engineering or Industrial Engineering

Job Description

We are seeking a Speech AI Engineer to design, build, and deploy intelligent speech systems capable of converting human voice into accurate text and detecting emotional tone and intent in speech.

Responsibilities:

Design and develop speech-to-text (ASR) models using modern architectures such as Whisper, Wav2Vec2, or Conformer.
Build speech emotion recognition (SER) models to classify tone, emotion, and mood from voice inputs.
Collect, preprocess, and annotate custom speech datasets — dataset creation is part of this role.
Apply data augmentation and noise-robust training for better real-world performance.
Implement quantization, pruning, and optimization of models for real-time inference on servers.
Develop and expose REST APIs for model access using FastAPI, ensuring scalability and security.
Integrate speech models with text-processing and chatbot systems for unified voice–text experiences.
Manage the full ML lifecycle — training, validation, deployment, monitoring, and continuous improvement.
Containerize and deploy models using Docker and orchestrate services via Kubernetes.
Continuously explore state-of-the-art speech and multimodal AI research to enhance accuracy and latency.

Required Skills & Experience:

Hands-on experience in speech recognition, speech emotion detection, or speaker identification.
Proficiency in Python and deep learning frameworks:
PyTorch, TensorFlow, HuggingFace Transformers
Strong understanding of audio feature extraction (MFCC, log-mel spectrograms, CTC loss, etc.).
Experience in model quantization and optimization (ONNX, TensorRT, TorchScript).
Proficiency with FastAPI, Docker, and Kubernetes for scalable deployment.
Familiarity with CI/CD, MLOps, and model monitoring workflows.
Knowledge of GPU acceleration and multi-model inference management.
Familiarity with databases (MongoDB, PostgreSQL, Redis).
Practical experience with real-time or streaming audio pipelines is a strong plus.

Job Requirements

Age

20 - 40 Years Old

Gender

Men / Women

Military service

Military service must be done

Education

Bachelor| Computer and IT

Bachelor| Electrical Engineering

Bachelor| Industrial Engineering

ثبت مشکل و تخلف آگهی

ارسال رزومه برای داده پردازی رسپینا

این آگهی بسته شده است

سوابق ارسال رزومه برای این شرکت