Transform Your Work Day with Images
Ben • August 16, 2024

One of the most underrated features of generative artificial intelligence (AI) is the ability to upload and use images to solve problems, work more efficiently, and improve your work. ChatGPT and Google’s Gemini allow you to upload images in many different file formats. The benefits are untapped, and I’m going to show you a couple easy use cases to jumpstart your success with images. 

We forgot using images because we’re focused on typing into AI solutions. And most of us type into software and computer programs. Yes, we’re always looking at “images” on our computer screens. AI is amazing at understanding images, especially on well documented tools like Microsoft Office. As we know, a picture is worth a thousand words. It’s why your IT team asks for screenshots to help solve problems.

Think about how often we use screenshots to communicate information clearly and efficiently. Now, imagine using that same visual power within your AI interactions. If you’re grappling with technical glitches, seeking knowledge on a specific topic, or simply trying to convey a complex idea, showing AI what you’re seeing can make all the difference.

It’s Easier Than You Think

Both ChatGPT and Gemini have streamlined the image upload process. In ChatGPT, you’ll find a handy paperclip icon, while Gemini offers a plus (+) symbol. A quick upload, a well-crafted prompt, and you’re ready to tap into a new level of AI assistance. 

Use Case 1: Handwritten Notes, Transformed

We’ve all been there: a productive meeting filled with scribbled notes, followed by the daunting task of deciphering, and transcribing them later. AI can help you bridge the gap between analog and digital.

Simply snap a photo of your handwritten notes (your smartphone is perfect for this), upload it to your AI tool of choice, and ask it to summarize the key points. AI’s impressive handwriting recognition capabilities will do the heavy lifting, saving you time and effort.

A Few Tips for Optimal Results:

  • Organization is Key – Avoid Word Salads:  While AI is remarkably at understanding even the messiest handwriting, it can struggle with notes scattered haphazardly across the page. Try to maintain some semblance of structure to aid the AI’s interpretation. 
  • Keywords Highlight Importance – Provide Some Basic Structure:  If certain takeaways, action items, or other crucial points stand out, consider writing they are “key points”, “to do’s”, “action steps”, or “due date”. This context helps AI identify what to prioritize in the summary and gives you key phrases to help prompt AI to write for your needs. 
  • Embrace Flexibility – Notes are Notes:  AI won’t always produce a flawless transcription, but remember, these are  your  notes. Some level of interpretation is perfectly acceptable. If you want AI to dictate something exactly, it can do that. But it’s unlikely your notes require that level of detail so don’t expect this type of precise rewriting. 

Once AI generates the summary, review it, make any necessary tweaks, and then effortlessly integrate the text into your preferred digital format—a Word document, an email, etc.

Here’s a video of ChatGPT taking a page of notes and turning it into a digital write up. Note how the writing has some structure, but the handwriting isn’t the most clear. Then watch how ChatGPT writes it out.

Use Case 2: Troubleshooting and Problem-Solving, Supercharged

From navigating basic or complex software interfaces to resolving unexpected technical hiccups, we all encounter challenges in our digital lives. The good news is that many of these issues have documented solutions online. AI can leverage this wealth of knowledge to become your personal tech support guru. Upload an image and provide some context, such as an error message, and let AI give you on demand assistance. 

Optimize Your AI Troubleshooting:

  • File Formatting Matters:  Images should generally be uploaded as an image file. This means a JPEG or PNG image will work exceptionally well. A PDF with an image is unlikely to give you great results. But with anything AI related, try both options to see what works for your need.
  • Start Simple:  Phrases like “help me troubleshoot this” or “can you help me solve this problem” are effective conversation starters. You may not understand your problem and that can make it hard to give AI direction. 
  • Embrace the Dialogue:  For technical issues, AI might not immediately deliver the perfect solution. Be prepared to engage in a back-and-forth exchange, providing additional context or clarifying questions as needed. Think of it as collaborating with a knowledgeable colleague to unravel the problem.

Here’s a video of Gemini helping me troubleshoot a Microsoft Power Automate issue. I uploaded a video and am simply having a conversation to better understand what went wrong and how I could fix it. 

A Picture  Is  Worth a Thousand Words

These are easy use cases you can try immediately. Testing AI’s limits with images and problem solving is a simply solution that can have very rewarding results. Start with AI!

It’s important to remember that your word choice matters when using AI. Consider these words. 

  • Summarize
  • Comprehensive
  • Detail
  • Elaborate
  • Troubleshoot
  • Rewrite
  • Clarify
  • Simplify

New users often haphazardly use these words and don’t get the results they expect or want. While AI is intuitive, it’s also logical. Learning how AI interacts with your prompts, like learning how to work with a colleague, will help you improve your output and productivity.   

By incorporating images into your interactions, you unlock a whole new level of communication and problem-solving potential. Don’t underestimate the power of showing AI what you see. It’s a simple yet incredibly effective way to harness the full capabilities of these remarkable tools.

Recent Posts

A robot is sitting at a desk with a computer and keyboard.
By Benjamin Udell January 8, 2025
ChatGPT’s Projects Tab is a small change with massive potential, making everyday use of ChatGPT easier and more productive. By integrating features from GPTs and Canvas into a cohesive workspace, it empowers professionals to work smarter, not harder. While it’s not perfect yet, and I'm sure there will be new updates soon, it’s a leap forward in making AI a practical tool for everyday use.
By Benjamin Udell January 4, 2025
New Feature Makes AI Even Easier Integrating ChatGPT’s Advanced Voice mode into your iPhone 16 is a game changer. Apple Intelligence is sneaking up on us, and making everyday AI usage even easier for everyone. Apple has been criticized for not doing enough with AI or dazzling us with magical solutions. Instead, they’re quietly adding new features that are helpful to everyday users. Apple takes a very thoughtful, measured approach—quietly rolling out AI features that are beginning to reshape how we interact with technology. One of the most underrated and easy to use developments is how the iPhone 16’s Action Button is bringing ChatGPT integration to the forefront. This makes AI more accessible for everyone—whether you’re a seasoned pro or completely new to the concept. Let’s review what this feature means, how to enable it, and why it makes AI even easier to use. Other Apple devices and products have been integrating ChatGPT, explore your settings and click the link learn more. Click here to learn more. What’s the Action Button All About? The iPhone 16 has an Action Button on the top-left side of the device. By default, you can set it to handle everyday tasks like: Silencing your phone Activating the flashlight Opening the camera But here’s where things get interesting: you can customize the button to launch ChatGPT, including its powerful Advanced Voice mode. This integration is a significant leap forward, especially compared to Siri, which many users, including myself, hate. Seriously, if Siri, Alexa, or other voice assistants haven’t worked for you, try ChatGPT Advanced Voice. It's one million times better. It has the capability, with no effort by you, of handling questions and tasks with the fluidity of human conversation. You don't need to use the Action Button to engage with ChatGPT Advanced Voice, it just makes it as easy as a couple quick presses of a button. How to Set Up ChatGPT on Your iPhone 16 Action Button To get the most out of this feature, you’ll want to subscribe to ChatGPT Plus ($20/month). This gives you access to advanced features, including Advanced Voice mode, which pairs seamlessly with the Action Button. If you use a free version of ChatGPT the Action Button will work by quickly accesses the ChatGPT app. Here’s how to set it up: Everyday Scenarios Where ChatGPT Shines The possibilities for using ChatGPT via the Action Button are practically endless. Here are some practical examples: At Work: Need help brainstorming ideas, researching a topic, or refining a presentation? Ask ChatGPT while you work. “Talk” to ChatGPT. On the Road: Heading to a meeting? Use ChatGPT to prep by asking questions about the subject matter or getting tips for discussion. If your iPhone is connected by Bluetooth to your car, you can talk to it through your car speakers. In the Kitchen: Stumped on an ingredient substitution or cooking technique? ChatGPT has your back. At Home or On the Go: From DIY fixes to helping kids with school projects, ChatGPT can provide step-by-step guidance. Use ChatGPT to plan itineraries, troubleshoot issues, or get instant answers to curious questions. Why This Matters By integrating ChatGPT into the iPhone 16, Apple has made AI an everyday tool—not just for tech enthusiasts but for everyone. The ease of accessing ChatGPT through the Action Button lowers the barriers to entry, allowing even complete novices to explore the benefits of AI. For experienced users, it’s a productivity boost that brings the power of ChatGPT closer to your fingertips (or voice). For newcomers, it’s an intuitive gateway into a world of possibilities. Customize your Action Button today and start exploring how ChatGPT can transform your personal and professional life. With the power of voice-enabled AI, your phone just got a whole lot smarter. 1.Enable Apple Intelligence Go to Settings > Apple Intelligence & Siri. Toggle Apple Intelligence to “On.” While you’re there, explore the Siri-related features for added customization. 2. Link ChatGPT to Apple Intelligence In the same settings menu, scroll to Extensions and select ChatGPT. Log in to your ChatGPT account to establish the connection. 3.Customize the Action Button Return to Settings and scroll to Action Button. You’ll see options like silent mode, camera activation, and more. Select Controls and choose Open ChatGPT or Open ChatGPT Voice. Pro Tip: Choose ChatGPT Voice for a fully hands-free experience. This feature lets you interact with ChatGPT as if you were having a conversation with a friend—perfect for multitasking—and it’s hands free. Why ChatGPT Advanced Voice Is a Must-Have Once enabled, ChatGPT Voice turns your iPhone into an AI powerhouse. Here’s why it’s worth your time: Hands-Free Convenience: Whether you’re driving, cooking, or on the move, you can speak directly to ChatGPT without touching your screen. Real-Time Assistance: Need quick advice or research while you typing or writing? Just press the Action Button and start talking. Seamless Interaction: The voice feature allows you to interrupt responses, refine queries, and interact naturally—just like you would with another person.
By Benjamin Udell November 22, 2024
Here are key areas of the mega prompt dissected, to help you understand key concepts of effective prompting. 1. Setting the Role and Context “You will act as a bank marketing, sales, and Ag leader” Starting the prompt by defining a clear role for AI is key. It helps AI understand its purpose and ensures that the responses are more relevant to your needs. Here, we're asking AI to play a specific role: a bank leader with knowledge of agricultural solutions. By doing this, AI can better contextualize its output and make sure the content is focused on banking and agricultural needs. For beginners, it’s important to be as specific as possible about what role you want AI to take on. This helps AI align itself with your goals and deliver more insightful responses. 2. Clarify the Goal “The goal is to educate staff on what ag lending is and how employees can identify opportunities or leads while providing excellent service.” Defining a clear goal helps guide the output. In this case, the goal is to educate bank staff about ag lending, with an emphasis on opportunity identification and excellent service. By specifying this, AI knows the end goal and can tailor its content to meet that objective. If you're new to AI, always make sure to state your goals clearly. The more AI knows about what you want to achieve, the more tailored and helpful the response will be. 3. Step-by-Step Collaboration “This will be a step-by-step process where you’ll walk me through each step to create content.” One of the things that makes AI useful for practical, everyday tasks is its ability to break down complex projects into manageable steps. In this prompt, we’re instructing AI to take us through the process step by step, which helps prevent feeling overwhelmed and ensures each part is tackled thoroughly. For new users, remember that AI can be a great collaborator. Use it to break down tasks, check in along the way, and adjust as you go. You can accomplish large tasks in smaller steps. 4. Confirming and Aligning Expectations “You will not move to the next step until I confirm your content is what I need and aligns with my expectations.” By making sure that AI pauses and waits for confirmation before proceeding, the process becomes a truly collaborative one. It allows you to review, adjust, and make sure the content fits exactly what you need before moving on to the next part. This is especially helpful if you’re not sure what you want the final product to look like—it gives you the freedom to refine your needs as the process unfolds. 5. Breaking Down Specific Content Steps The prompt then breaks down specific tasks, like writing a guide, drafting an email, or creating a FAQ. Each task is clear and focused, making it easier for AI to respond accurately and for you to evaluate the content created. For beginners, this structured approach—breaking content down into discrete pieces—ensures that nothing is missed, and you have control at every step of the way. Why This Prompt Shows AI's Transformational Power This prompt is a great example of how AI can take a complex problem—like creating training materials for an entire organization—and break it down into clear, manageable tasks. It shows how AI can serve as an assistant that works alongside you, step by step, ensuring the final result is exactly what you need. This is a brief example, but one that can create draft in minutes rather than hours. It is a jumping off point to refine with AI or edit personally.