One of the most underrated features of generative artificial intelligence (AI) is the ability to upload and use images to solve problems, work more efficiently, and improve your work. ChatGPT and Google’s Gemini allow you to upload images in many different file formats. The benefits are untapped, and I’m going to show you a couple easy use cases to jumpstart your success with images.
We forgot using images because we’re focused on typing into AI solutions. And most of us type into software and computer programs. Yes, we’re always looking at “images” on our computer screens. AI is amazing at understanding images, especially on well documented tools like Microsoft Office. As we know, a picture is worth a thousand words. It’s why your IT team asks for screenshots to help solve problems.
Think about how often we use screenshots to communicate information clearly and efficiently. Now, imagine using that same visual power within your AI interactions. If you’re grappling with technical glitches, seeking knowledge on a specific topic, or simply trying to convey a complex idea, showing AI what you’re seeing can make all the difference.
It’s Easier Than You Think
Both ChatGPT and Gemini have streamlined the image upload process. In ChatGPT, you’ll find a handy paperclip icon, while Gemini offers a plus (+) symbol. A quick upload, a well-crafted prompt, and you’re ready to tap into a new level of AI assistance.
Use Case 1: Handwritten Notes, Transformed
We’ve all been there: a productive meeting filled with scribbled notes, followed by the daunting task of deciphering, and transcribing them later. AI can help you bridge the gap between analog and digital.
Simply snap a photo of your handwritten notes (your smartphone is perfect for this), upload it to your AI tool of choice, and ask it to summarize the key points. AI’s impressive handwriting recognition capabilities will do the heavy lifting, saving you time and effort.
A Few Tips for Optimal Results:
Once AI generates the summary, review it, make any necessary tweaks, and then effortlessly integrate the text into your preferred digital format—a Word document, an email, etc.
Here’s a video of ChatGPT taking a page of notes and turning it into a digital write up. Note how the writing has some structure, but the handwriting isn’t the most clear. Then watch how ChatGPT writes it out.
Use Case 2: Troubleshooting and Problem-Solving, Supercharged
From navigating basic or complex software interfaces to resolving unexpected technical hiccups, we all encounter challenges in our digital lives. The good news is that many of these issues have documented solutions online. AI can leverage this wealth of knowledge to become your personal tech support guru. Upload an image and provide some context, such as an error message, and let AI give you on demand assistance.
Optimize Your AI Troubleshooting:
Here’s a video of Gemini helping me troubleshoot a Microsoft Power Automate issue. I uploaded a video and am simply having a conversation to better understand what went wrong and how I could fix it.
A Picture Is Worth a Thousand Words
These are easy use cases you can try immediately. Testing AI’s limits with images and problem solving is a simply solution that can have very rewarding results. Start with AI!
It’s important to remember that your word choice matters when using AI. Consider these words.
New users often haphazardly use these words and don’t get the results they expect or want. While AI is intuitive, it’s also logical. Learning how AI interacts with your prompts, like learning how to work with a colleague, will help you improve your output and productivity.
By incorporating images into your interactions, you unlock a whole new level of communication and problem-solving potential. Don’t underestimate the power of showing AI what you see. It’s a simple yet incredibly effective way to harness the full capabilities of these remarkable tools.