We are looking for a hands-on Junior AI Developer to join our team in building next-generation AI solutions. This role goes beyond simply running models on Google Colab — you will be directly involved in the end-to-end development lifecycle, from frontend interface development, backend logic implementation, and model optimization (RAG/Inference) to deployment packaging (Docker/Linux).
KEY RESPONSIBILITIES:
- AI Implementation: Deploy and optimize open-source LLMs (such as Llama 3, Qwen, Mistral, etc.) using Ollama or vLLM on on-premise infrastructure.
- System Development: Build RAG (Retrieval-Augmented Generation) systems integrated with Vector Databases such as ChromaDB, Milvus, and Qdrant.
- Fullstack Lite: Develop user interfaces (Web UI) using HTML5, CSS3, JavaScript (or React/Streamlit) and integrate them with Python APIs built on FastAPI/Flask.
- Deployment: Package applications into Docker containers, and manage as well as operate systems in Linux/Ubuntu Server environments.
- Open-source Contribution: Research, apply, and customize the latest open-source tools and frameworks from the global AI community.





