About the Project
Challenges & Solutions
Optimizing AI Recognition Accuracy via Multimodal Analysis
Initially, basic Vision APIs were used for label recognition. However, accuracy suffered in complex environments or at specific angles. Furthermore, the system could not generate educational-grade translations or spelling suggestions tailored for children.
Migrated to Google Gemini Ai and implemented advanced Prompt Engineering. By providing specific context instructions, we guided the AI to perform deep image analysis, generating not just object names, but also bilingual vocabulary, contextual sentences, and spelling breakdowns, significantly enhancing the educational value and accuracy.
Implementing Automated CI/CD Workflows for Enhanced Development Efficiency
In the early stages of development, manual deployment was time-consuming and prone to human error. It was also challenging to ensure consistency between development and production environments, which limited the pace of iteration.
Integrated GitHub with Vercel to establish an automated deployment pipeline. Every time code is pushed to the main branch, the system automatically triggers build and deployment tasks, achieving "code-to-live" seamlessness. This ensures deployment reliability and allows me to focus on feature development rather than environment configuration.