AI has been the hype since Chat-GPT commoditized the use of its AI chatbot. Soon after, many AI chatbots followed suit, including Microsoft Copilot, Google Gemini, Anthropic Claude, etc. Many new applications and startups using these AI APIs sprung up as well.
There are a lot of terminologies in play, and it gets confusing very fast. My goal has always been to make complex things easy to understand and break them into small digestible chunks that can be then delved into individually.
The article’s core focus is on making AI accessible and practical for everyone, breaking down complex concepts, and emphasizing that AI is not just another tech hype but a revolutionary technology that everyone should understand. I also list some of the AI tools that I have been playing with that will give you an idea of the leverage you can get with using these tools.
AI is a huge revolution, and it should not be limited to the technical folks but should be understood and leveraged by everyone.
[Photo by Mariia Shalabaieva on Unsplash+]
Getting Started with AI
It is overwhelming to start anything, but getting started in a large and diverse space like AI has been challenging for many. The AI technology space is advancing at an accelerated speed, so the time to get started is now.
👉🏼 I wrote an article Master AI: The Ultimate Learning Plan for Beginners (2024 Guide), that covers some suggestions, ideas, and a structured learning path for AI beginners, covering everything from programming fundamentals to advanced AI technologies.
Is AI just another hype?
I have always been skeptical about hypes and tread carefully around them. It is easy to get lost in the rat race and chase the latest, cool, shiny new thing. I saw a few hype cycles in the 30+ years of my career - E-commerce, SOA (SOAP, XML, 3-tier arch.), Web 2.0 dot com era (blogs, wikis, social networking, crowd-sourcing, RSS, REST, Ajax, web sockets), the Cloud (IaaS, PaaS, SaaS, OpenStack, AWS, Azure, GCP), containers (Docker & Kubernetes), Serverless (Lambda, Serverless Framework), microservices, Web 3.0 (decentralized internet, blockchain, cryptocurrency, smart contracts), the proliferation of JS & UI frameworks, micro-frontends, and now AI…
Not all hype cycles need to be chased or taken off into practical implementations, but the AI revolution that we are in - is a game changer. You should pay attention and carve out the space that is right for you. There is something valuable for everyone, and it is not just any other tech hype. It will be critical for all of us to understand and leverage AI to remain competitive.
💬 Understanding and Analyzing AI Chatbots
It is hard to get one’s head around this space, so I wrote an article about comparing Chat GPT, Gemini, Copilot and Claude AI Chatbots. The article covers the basics of what these AI chatbots can do and describes the criteria to evaluate these chatbots based on one’s individual needs. It also covers a high-level comparison of AI chatbots, with a detailed analysis of the strengths and weaknesses of each, highlighting the comparison criteria & their Impacts. It also compares the image-generation capabilities, use cases & best-case scenarios, and future developments & trends in the AI chatbots.
The AI Tooling Space
🎨 Authoring & Image Generation Tools
Many AI image-generation tools and platforms listed below have been pioneering the Generative AI space.
The big players and popular AI tools:
- Sora: Open AI’s latest platform to generate entirely new videos from text, bring images to life, text to video, image to video, video to video, or extend, remix, or blend videos you already have. New interfaces that allow easier prompting, creative controls, and community sharing. Videos up to 1080p and 20s long, in widescreen, vertical, or square. Available as part of the Plus subscription or with 10x more usage/higher resolution as part of a Pro subscription.
- DALL-E 3: Modern AI text-to-image generator. Is available for all ChatGPT users. API support.
- Midjourney: AI image generator via prompts.
- Adobe Firefly: Firefly models and services power generative AI features in Adobe creative apps.
- Microsoft Designer: Create, design, and edit anything you can imagine with AI.
- Imagen 3: Imagen 3 is Google’s highest-quality text-to-image model, capable of generating images with even better detail, richer lighting , and fewer distracting artifacts than our previous models. Imagen 3 is deployed with the latest privacy, safety, and security technologies, including our innovative watermarking tool SynthID — which embeds a digital watermark directly into the pixels of the image, making it detectable for identification but imperceptible to the human eye.
- Genie 2: Genie 2 is Google’s revolutionary foundation model capable of generating fully interactive, playable 3D environments—no coding required.
The other popular AI tools:
- WOMBO Dream: Enter a prompt, pick an art style and WOMBO Dream turns your idea into an AI-powered painting in seconds.
- OpenArt.ai: OpenArt is an AI-powered art platform where users can create images using pre-built models or by training their own models. A suite of AI tools is provided to enhance the creation of the artwork.
- StableDiffusion (DreamStudio): Use the Stable Assistant chatbot to generate and edit images and access Stability AI’s latest tools.
- RunwayML: AI Art Generator
- Leonardo.ai: Generative AI with a unique set of suite of tools for art, image, video, graphic design, and marketing.
- Khroma: Creates infinite color palettes according to your prompts.
- Ideogram: Ideogram is a free-to-use AI tool that generates realistic images, posters, logos and more.
- UIzard: Scans your sketches and turns them into editable, testable digital designs.
- Artbreeder: Create characters, artworks, and more with multiple tools, powered by AI. Use Composer to create by mixing images and text together. Use Collager to create images out of simple shapes, images, and text.
- Jasper Art: Create stunning, on-brand visuals. Remove or replace backgrounds, uncrop images, upscale images, remove text, and reimagine composition, setting, or subject. API support.
- NightCafe: Create amazing artwork using the AI Art Generator.
- Fontjoy: Speeds up the process of font pairing.
- Pika: Transform photos with signature effects.
- Descript: Descript is the AI-powered, fully featured, end-to-end video editor. Free tier available.
- Black Forest Labs: Use FLUX.1 Tools, a suite of models designed to add control and steerability to our base text-to-image model FLUX.1, enabling the modification and re-creation of real and generated images.
I want to call out my favorites based on their awesomeness 🤯 factor:
- Kling AI: AI Creative studio for images and videos. Check out Motion Brush. Precise control of movements. End Frame. Seamless transitions between images. Camera Movement. Smooth & steady camera movements. Lip Sync.
- Copy.ai: Creates different types of copy based on being prompted with the brand, product name, and description.
- Canva Magic Design: Magic Studio brings together the best AI-powered tools for you and your team, right inside Canva. Use Magic Design to create custom, on-brand, and attention-grabbing content. Use Dream Lab to create stunning, production-quality visuals. Use Magic Resize to swap formats, languages, and dimensions in a snap. Use Magic Media, to dream up and then add it to your design. Other AI tools include Magic Expand, Magic Grab, Magic Edit, Magic Eraser, Background Remover, Magic Animate, Magic Morph, and Magic Write.
- Google NotebookLM: It helps you make sense of complex information. When you upload your sources, it instantly becomes an expert, grounding its responses in your material with citations and relevant quotes. And since it’s *your* notebook, your personal data is never used to train NotebookLM. Audio Overview turns your documents into engaging audio discussions and a podcast. Two AI hosts start up a lively “deep dive” discussion based on your sources. They summarize your material, make connections between topics, and banter back and forth. You can even download the conversation and take it on the go.
And, then some of the services that are cool but I have security concerns about:
- Diffuse (Higgsfield AI): Diffuse is an AI-powered mobile app for personalized entertainment. Watch yourself star in your own show, take personality quizzes with AI-powered results, use one of our 1000s of templates to create fun and hilarious videos, or create original scenes from scratch using our text-to-video model.
- Captions: Generate and edit talking videos with AI. Create 3D avatars. Translate your voice into 28+ languages with synced lip movement to match.
- HeyGen: HeyGen is an innovative video platform that harnesses the power of generative AI to streamline your video creation process. Create and translate videos with AI. Produce studio-quality videos in 175 languages without a camera or crew. Write your script, and watch an avatar read it flawlessly in one take.
🔐 Security Concern: Does uploading your selfie, picture, or voice to an online service put you at risk of them being used in other deepfakes?
💻 App Development Tools
No code, low code, and AI-assisted full-stack app development tools.
- Bubble.io: The full-stack, no-code app builder. Quickly design and launch scalable applications with our AI-powered no-code development platform. Instantly connect to an LLM using a plugin or API key. Hundreds of customizable Bubble templates and reusable components. Use the drag-and-drop editor to arrange them.
- Replit: Create beautiful, modern web applications. Describe what you need, and Replit’s AI Agent builds it for you. With its code editor built for collaboration, you can write, review, and debug code. Code together in real-time with live cursors. Its cloud environments and flexible hosting for any type of project allow easy development and deployment. It supports popular frameworks that are built-in with their automatic package installer. No downloads. No config. No setup.
- Bolt.new: Prompt, run, edit, and deploy full-stack web apps. Start from a custom prompt or start a blank app with your favorite stack.
- Writer: With AI Studio, build AI apps and workflows. Palmyra, a family of LLMs, is purpose-built for the enterprise to give you quality and control, without high costs. Knowledge Graph, a graph-based RAG, anchors generative AI in your company data. Enforce your regulatory, legal, inclusivity, and brand rules across all work, whether it’s created by your people or AI.
- BoltAI: Provides all AI models in one app. You can switch between top AI services and local models. You can do all this from a single native app on your Mac. Features include an interactive chat UI, Prompt library, AI Commands, AI Inline, and an AI Coding Assistant within your IDE, and can generate high-quality posts. Get instant access to large language models and Stable Diffusion in one tool.
✨ One special mention of an AI code assistant that I regularly use is Cloving.ai. Cloving.ai is a cutting-edge, open-source command line tool that seamlessly integrates AI into the developer workflow, enhancing productivity and code quality. If you are a fan of integrating AI into your development workflow and working on apps interactively, check out Cloving.ai.
👉🏼 Check out my article on how to build a web app using Cloving AI without the prompt engineering headaches.
🎉 I recently experimented with Bolt.new AI assistant and was blown away by its code assistance features. I built a Todo List app from scratch by just providing prompts to Bolt.new.
👉🏼 Check out my article: Exploring AI Assisted Development: Building a Feature Rich Web App With Bolt.new.
👩🏼💻 AI Code Assistants
AI code assistants use AI to generate code and help with various coding and programming-related tasks. They can analyze code, provide suggestions, provide solutions to bugs, and even generate code based on prompts. They are often embedded within IDEs and support a wide variety of programming languages.
Here are a few of the popular AI code assistants that let you iteratively work on your code base:
- GitHub Copilot
- Microsoft IntelliCode
- Codeium (Windsurf): Either use the Windsurf Editor, an AI IDE and code editor, or install the Codeium extension in your favorite code editors. Allows developers to build entire applications, predict intent with fast autocomplete, and provide in-line code generation and natural language instructions in the terminal.
- Cursor: An AI code editor that lets you breeze through changes by predicting your next edit. Get answers from your codebase or refer to files or docs. It lets you write code using instructions and update entire classes or functions with a simple prompt. With Privacy Mode, your code is only stored locally.
- V0: Chat with v0 by Vercel. Generate UI with simple text prompts. Copy, paste, ship. Iterate on visual components with live preview. The downside is that it can’t develop full apps but I use it for individual component development, quick mock-ups, or prototypes.
- Tabnine: An AI code assistant that helps you ship higher quality software faster with AI tools for code generation, testing, and code review custom-fit to each engineering team. From code creation & explanations, to test and doc generation & bug fixes, to faster app development. Complete code privacy with zero data retention.
- Cline: An AI assistant that can use your CLI aNd Editor. Cline can handle complex software development tasks step-by-step, create & edit files, explore large projects, use the browser, and execute terminal commands and can assist in ways that go beyond code completion or tech support.
Apart from coding assistants, there are enterprise-focused task-specific assistants like Harness’ AI DevOps Engineer and QA Assistant for pipeline generation and test automation, and AI agents like All Hands that can perform more end-to-end software development.
🗣️ Support Chatbot Agents
Another use case for AI that has been very useful is the support of chatbot agents. These agents can directly interact with end-user customers and/or support contact center agents to help them.
Aisera, Decagon, and Sierra’s agents interact directly with end customers, while Observe AI supports contact center agents with real-time guidance during calls.
There are solutions like Glean and Sana that can connect to emails, messengers, and document stores and thus enable unified semantic search across disparate systems and deliver AI-powered knowledge management.
💡Check out the 2024 The State of Generative AI in the Enterprise report.
🪝So what’s the catch?
Well, it is not as simple to use these AI tools. Software development is a thoughtful, creative, and complex process. Experienced engineers can do wonders with these AI tools to augment their already creative side. Always remember that the AI tools are just an “assistant” and cannot replace the experience of a hardened engineer. It can, however, accelerate the speed of app development and automate mundane repetitive tasks.
It might seem like magic but experienced engineers review and refactor the code that is generated by the AI tools. They might add error handling, logging, and instrumentation to make the code run efficiently in production. They might add more test cases to the basic unit and integration tests generated by AI. They can foresee potential gaps, issues, and edge cases that the AI missed. They can look beyond the simplistic code generated by the AI tools and enforce best practices, standards & architectural patterns that are internal to their organization.
Be cautious of what code you “accept”, and make sure you have a robust code review process for reviewing code generated by AI. Make sure what code you let the AI tool access so that your IP is not being leaked by the AI tools that you are using.
👉🏼 Bottomline: No! - developers are not becoming redundant, but basic, routine & mundane programming tasks might be automated with the help of AI tools. A quick prototype, boilerplate app starters, UI and styling assist, code for a complex algorithm, assisted code completion, suggested code blocks, and syntax checks - will give you a head start and a quick boost, but you need engineers to drive the app home to production.
Summary
While the AI space is overwhelming and rapidly advancing, the time to get started is now - waiting is not an option.
Overview & Context
- AI is not just another tech hype - it’s a game-changing, revolutionary technology that everyone needs to understand and leverage to remain competitive
- The AI technology space is advancing rapidly, making it crucial to get started now despite its overwhelming nature
AI Tools Landscape
- Major AI chatbots include ChatGPT, Gemini, Copilot, and Claude, each with unique strengths in areas like NLP, coding, and multimodal abilities
- Image generation is led by tools like DALL-E 3, Midjourney, Adobe Firefly, and Google’s Imagen 3, offering various capabilities for creative work
- Notable AI development tools include AI code assistants like GitHub Copilot and Codeium, plus no-code platforms like Replit, Bubble.io, BoltAI, and Cloving.ai for app development
- AI code assistants like GitHub Copilot, Codeium, and Cursor help developers with code generation, suggestions, and bug fixes
Enterprise Applications
- AI-powered support chatbots and agents are transforming customer service and contact center operations
- Enterprise solutions like Glean and Sana enable unified semantic search and AI-powered knowledge management across various systems
Important Considerations
- Security concerns exist with certain AI tools, particularly those requiring uploads of personal data like selfies or voice recordings
- While AI tools are powerful assistants, they cannot replace experienced engineers who provide crucial oversight, error handling, and architectural expertise
- Organizations must maintain robust code review processes and protect their IP when using AI tools in development
If you have questions or feedback, please let me know in the comments below.