CÔNG TY TNHH GALAXY DIGITAL HOLDINGS
Galaxy Innovation Hub - D1 Road, Hi-Tech Park, District 9, HCMC.
Posted date: 04-11-2025
Experience
3 - 5 Years
Job level
Experienced (Non - Manager)
Salary
Competitive
Model Development & Optimization
· Maintain and enhance existing AI models for OCR on Vietnamese ID cards (CCCD) and extend to other document types (passports, driver licenses, bank documents).
· Fine-tune and adapt state-of-the-art OCR/document models (Donut) for production use.
· Optimize training and inference pipelines for performance, scalability, and cost efficiency.
Data Pipeline & Quality Management
· Manage large datasets combining synthetic and real-world document images.
· Build preprocessing and augmentation pipelines: image quality checks, blur/rotation detection, Vietnamese text normalization, PII masking.
· Ensure data quality and evaluation consistency across multiple document types.
Accuracy & Performance Evaluation
· Define and monitor evaluation metrics: character/word accuracy, exact match rate, edit distance, latency.
· Analyze failed predictions (e.g., accents, truncated fields, misrecognized entities) and integrate findings into retraining cycles.
· Implement image/document quality control to prevent poor inputs from degrading OCR accuracy.
Production & Monitoring
· Deploy, monitor, and maintain OCR models serving production workloads (100k+ documents/month).
· Investigate and resolve production failures, manage rollbacks, and improve system robustness.
· Collaborate with backend engineers to integrate OCR APIs with downstream systems.
Collaboration & Leadership
· Mentor junior engineers in computer vision and OCR best practices.
· Contribute to the long-term roadmap for Document AI, beyond ID cards, to support broader fintech/eKYC and document processing needs.
· Document experiments, model updates, and operational practices.
Must-have
· 3+ years of AI/ML engineering experience with Python and PyTorch.
· Practical experience in OCR or Computer Vision (e.g., image preprocessing, OpenCV).
· Experience with Vietnamese text processing (accents, tokenization, normalization).
· Familiarity with deep learning model training and fine-tuning, preferably with HuggingFace Transformers or OCR frameworks (PaddleOCR, Tesseract).
· Experience deploying ML models into production environments.
· Experience scaling machine learning services for high traffic.
· Knowledge of Linux, Docker, and Git.
Nice-to-have
· Knowledge of MLOps tools (Weights & Biases, MLflow, DVC).
· Model optimization skills: quantization, distillation, ONNX/TensorRT.
· Background in fintech/eKYC or handling sensitive/PII data.
Soft Skills
· Strong ownership mindset: accountable for the full lifecycle of OCR models.
· Problem-solving ability: capable of debugging training and inference issues.
· Communication skills: explain ML concepts and findings to technical and non-technical stakeholders.
· Collaborative attitude: work closely with backend, product, and QA teams.
Tech Stack
· Python, PyTorch, HuggingFace Transformers, PaddleOCR
· OpenCV, PIL
· Docker, Linux
· Git, DVC (optional)
· MLflow / Weights & Biases (nice-to-have)
Quy mô: 100-499 nhân viên
Lĩnh vực: Phần Mềm CNTT/Dịch vụ Phần mềm
Địa chỉ: Galaxy Innovation Hub, Road D1, Hi-Tech Park, District 9, HCMC.
Galaxy Holdings, a tech sector leader, is dedicated to shaping the future of the Digital Age. We offer a comprehensive benefits package, healthcare, Galaxy Elite perks, and vibrant cultural activities. As part of Sovico Group, we provide cutting-edge technology products and services to enhance lives and drive digital innovation. We deliver reliable solutions for corporate partners and personalized digital experiences for consumers. Committed to societal growth, we connect nations through technology, support our staff with integrity, and strive to build a prosperous Vietnam in the digital era. Our core values include being Human-Centric, Data-Driven, Transparent, Agile, and Ownership.
Our subsidiary company:
Galaxy One - https://galaxy.one/
Galaxy Joy - https://skyjoy.vietjetair.com/galaxyjoy/
Galaxy Telecom - https://skyfi.vn/buy-esim/nations/
Galaxy Connect - https://galaxyholdings.co/vi/subsidiaries/galaxy-connect/
Galaxy Pay - https://galaxyholdings.co/vi/subsidiaries/galaxy-pay/
Galaxyt Technology Services - https://www.galaxytechnology.vn/
FinOS - https://www.linkedin.com/company/finosvietnam/?originalSubdomain=vn
* Working place: Galaxy Innovation Hub, Road D1, Hi-Tech Park, District 9, HCMC.
* Working time: 44 hours/week from Monday to Friday
Galaxy Innovation Hub - D1 Road, Hi-Tech Park, District 9, HCMC.
https://galaxyholdings.co/
Company size: 100-499
Contact person: Phòng Tuyển dụng Nhân tài