Key Responsibilities
· Manage, operate, and optimize CT Group’s hybrid infrastructure, including On-Premise systems, FPT Cloud, and Azure Cloud.
· Administer and maintain Kubernetes (K8s) clusters, including Helm, Ingress, and Service Mesh (Istio, Linkerd).
· Design, implement, and maintain CI/CD pipelines, Infrastructure-as-Code (IaC), and automation solutions using GitLab CI, Jenkins, ArgoCD, Terraform, and Ansible.
· Operate and optimize ERP systems (Odoo, SAP, Dynamics…) and web applications (PHP, Python, Node.js, .NET), ensuring performance, scalability, and security.
· Design and maintain system architecture for high availability, scalability, disaster recovery, and hybrid/multi-cloud operations.
· Implement system observability: monitoring, logging, alerting, and tracing (Prometheus, Grafana, ELK, Loki, Alertmanager).
· Ensure infrastructure and application security: IAM, RBAC, TLS, PKI, WAF, secrets management, web and API security.
· Collaborate with Data, Infra, and Security teams to integrate infrastructure with data platforms and internal applications.
· Support DevOps, SRE, and FinOps practices to optimize system reliability and cloud operating costs.
· Troubleshoot incidents, perform root cause analysis, and execute performance tuning for mission-critical systems.









