For my final project, I am developing a Python-based desktop application titled Drawing AI App, which merges interactive drawing, artificial intelligence recognition, and conversational voice feedback into a cohesive user experience. The project demonstrates how creative coding and AI integration can work together to create engaging, accessible tools for users of all skill levels.
Project Overview
Drawing AI App offers users a responsive digital canvas where they can sketch freely using a variety of drawing tools. Once a drawing is complete, an AI assistant attempts to recognize the object or shape, providing spoken and visual feedback. The application aims to create an experience that feels intuitive, light-hearted, and educational, balancing technical sophistication with user-centered design.
Core Features
Drawing Canvas
The application features a PyQt5-based drawing canvas with support for adjustable pen sizes, a color palette, eraser functionality, and a clear canvas button. The interface is designed for responsiveness and ease of use, providing users with a clean, modern environment for creative expression.
AI Drawing Recognition
Using OpenAI’s Vision API, the AI assistant analyzes the user’s drawings and offers recognition guesses, displaying confidence percentages for the top three predictions. If the AI’s confidence is low, it will prompt the user for additional clarification.
Voice Communication
The AI assistant communicates through text-to-speech functionality, initially implemented with pyttsx3 for offline use, with future plans to integrate OpenAI’s TTS API for more natural speech output. Voice feedback enhances interactivity, making the experience more immersive.
Educational Component
An integrated information section explains how AI recognition operates, encouraging users to engage with the underlying technology. Users can correct the AI’s guesses, offering a learning opportunity and highlighting the limitations and strengths of machine learning models.