Awesome Generative AI

A curated list of modern Generative Artificial Intelligence projects and services.

Generative Artificial Intelligence is a technology that creates original content such as images, sounds, and texts by using machine learning algorithms that are trained on large amounts of data. Unlike other forms of AI, it is capable of creating unique and previously unseen outputs such as photorealistic images, digital art, music, and writing. These outputs often have their own unique style and can even be hard to distinguish from human-created works. Generative AI has a wide range of applications in fields such as of art, entertainment, marketing, academia, and computer science.

Contributions to this list are welcome. Before submitting your suggestions, please review the Contribution Guidelines to ensure your entries meet the criteria. Add links through pull requests or create an issue to start a discussion. More projects can be found in the Discoveries List, where we showcase a wide range of up-and-coming Generative AI projects.

Text

Chatbots

ChatGPT

Conversational AI system by OpenAI based on their GPT models.

OpenAI

Claude

Anthropic's AI assistant trained to be helpful, harmless, and honest.

Anthropic

Gemini

Google's family of multimodal large language models.

Google

Perplexity

AI-powered search and discovery tool that provides answers based on real-time web information.

Perplexity AI

Pi

Personal AI assistant from Inflection AI, designed for natural, supportive conversations.

Inflection AI

Character.AI

AI characters that can engage in roleplay and discussion with users.

Character.AI

Text Generation

GPT-4

OpenAI's large multimodal model capable of solving difficult problems with greater accuracy.

OpenAI

LLaMA

Meta's family of foundational large language models.

Meta Open Source

Mistral

Open models with competitive performance and commercial usage freedom.

Mistral AI Open Source

Claude

Anthropic's AI assistant that can handle complex tasks while being helpful, harmless, and honest.

Anthropic

Cohere

Models specialized for enterprise use, focusing on text generation, search, and chatbots.

Cohere

Jasper

AI content platform for marketing teams to create brand-compliant content.

Marketing

Research Papers and Summaries

Elicit

AI research assistant that helps find papers, extract insights, and summarize findings.

Research

SciSpace

Platform to discover, read, and understand scientific literature more effectively.

Research

Scholarcy

Reads research articles and creates summaries of key findings.

Research

Semantic Scholar

AI-powered research tool to help scholars discover and understand scientific literature.

Research

Connected Papers

Visual tool to explore academic papers and their connections.

Research

Coding

GitHub Copilot

AI pair programmer by GitHub and OpenAI that suggests code completions.

GitHub OpenAI

Amazon CodeWhisperer

AWS's AI code generator that offers real-time suggestions.

AWS

Cursor

AI-first code editor with GPT-4 built in for coding assistance.

IDE

Codeium

Free AI coding assistant with autocomplete, chat, and search features.

Free

Replit Ghostwriter

AI pair programmer integrated into the Replit IDE.

Replit

Tabnine

AI code completion tool supporting multiple programming languages and IDEs.

Code Completion

CodeGeeX

Multilingual code generation model with 13B parameters.

Open Source

CodeT5

Open-source code generation tool by Salesforce.

Salesforce Open Source

WPCode

AI code generator for WordPress focused on snippets and customizations.

WordPress

Continue

Open-source coding assistant that runs on your laptop.

Open Source Local

Comparing AI Coding Assistants

As AI coding tools continue to evolve, developers now have multiple options to choose from. Here's how some of the leading tools compare:

Tool Best For Pricing Key Features
GitHub Copilot Professional developers $10/month or $100/year Integrated with VS Code, JetBrains IDEs, and Neovim; Advanced code completion; Context-aware suggestions
Amazon CodeWhisperer AWS developers Free for individual use Integrated with AWS Cloud9; Security scanning; Code references
Cursor Fast prototyping Free, Pro: $20/month Built on VS Code; Chat interface; GPT-4 integration
Codeium Individual developers Free for individuals Supports 20+ languages; IDE integrations; Fast autocomplete

Agents

What are AI Agents?

AI agents are autonomous systems that can perceive their environment, make decisions, and take actions to achieve specific goals. Unlike regular AI models that respond to single prompts, agents can perform complex sequences of tasks, reason about information, and use tools to accomplish objectives with minimal human intervention.

AutoGPT

Open-source autonomous GPT-4 agent.

Open Source GPT-4

BabyAGI

AI-powered task management system running on OpenAI and Pinecone.

Open Source Task Management

AgentGPT

Autonomous AI agents accessible through a web browser.

Web-based

Langchain

Framework for building applications powered by language models.

Open Source Framework

E2B

Infrastructure and tools for AI agents, providing sandboxed cloud environments.

Infrastructure

HuggingGPT

Microsoft's open-source platform connecting LLMs with ML models.

Open Source Microsoft

OpenDevin

Open-source autonomous software engineer.

Open Source Coding

Smol Developer

Coding agent for generating entire projects from prompts.

Open Source Coding

GPT Engineer

Agent that generates code for entire software projects.

Open Source Coding

MetaGPT

Multi-agent framework for complex task solving.

Open Source Framework

How AI Agents Work

AI Agent Architecture

Diagram showing the basic architecture of an AI agent system with planning, reasoning, and execution components.

Image

DALL-E 3 Image

DALL·E 3

OpenAI's latest text-to-image model

Midjourney Image

Midjourney

Known for artistic quality and stylization

Stable Diffusion Image

Stable Diffusion

Open-source diffusion model

AI Art Generation

DALL·E 3

OpenAI's text-to-image model known for photorealistic results.

OpenAI

Midjourney

Independent AI art generator known for artistic quality.

Discord

Stable Diffusion

Open-source text-to-image model by Stability AI.

Open Source

Firefly

Adobe's family of generative AI models for creative use.

Adobe

DreamStudio

User-friendly interface for Stable Diffusion.

Stability AI

Leonardo.Ai

AI art generation platform focused on creative asset creation.

Assets

Image Editing and Enhancement

Photoshop AI Features

AI tools in Photoshop including Generative Fill.

Adobe

Luminar Neo

Photo editor with AI-driven features like sky replacement.

Photo Editing

Topaz Photo AI

AI-powered image enhancement focusing on noise reduction and upscaling.

Enhancement

Let's Enhance

AI image upscaling and enhancement service.

Upscaling

Gigapixel AI

AI image enlargement tool.

Upscaling

Clipdrop

Suite of AI image editing tools including background removal and upscaling.

Multiple Tools

Effective Prompting for AI Image Generation

Creating effective prompts is key to getting the best results from AI image generators. Here are some tips to improve your prompts:

Prompt Structure Elements

  • Subject: Clearly define what you want as the main focus (person, object, scene)
  • Setting: Describe the environment or background
  • Style: Specify artistic styles (e.g., photorealistic, watercolor, anime)
  • Lighting: Describe the lighting conditions (e.g., golden hour, dramatic, soft)
  • Composition: Indicate framing, perspective, or camera angle

Example Prompt

"A serene mountain lake at sunrise, with a small wooden cabin on the shore, mist rising from the water, ultra-detailed digital painting, dramatic lighting, in the style of Thomas Kinkade, 4K resolution"

Video

Sora by OpenAI

Text-to-video AI model

RunwayML Gen-2

Text and image to video generation

Video Generation

Sora

OpenAI's text-to-video model capable of generating realistic and imaginative scenes.

OpenAI

RunwayML Gen-2

AI video generation model supporting text-to-video, image-to-video, and customization options.

Runway

Pika

AI video generator with text-to-video, image-to-video, and video editing capabilities.

Creative

Stable Video Diffusion

Open source text-to-video and image-to-video model by Stability AI.

Audio

Generative AI is revolutionizing audio production across music, voice synthesis, sound effects, and audio enhancement. From creating original compositions to cloning voices and restoring audio quality, these tools are transforming how we create and interact with sound.

Music Generation

Suno

AI music generator creating fully-produced songs from text prompts.

Music

Udio

Google's AI music generation platform (formerly MusicLM).

Google

Soundraw

AI music generator creating royalty-free music for content creators.

Royalty-free

Mubert

AI-powered royalty-free music streaming and generation platform.

Streaming

AIVA

AI composer for emotional soundtracks for games, films, and content.

Soundtrack

AudioCraft

Meta's open-source library for audio generation with models like MusicGen.

Open Source

Voice Synthesis and Cloning

ElevenLabs

AI voice cloning and speech synthesis with high-fidelity output.

Voice

Play.ht

AI voice generator with 900+ voices in 142+ languages with emotion control.

Voice

Resemble AI

Voice cloning platform for enterprise applications.

Enterprise

Descript Overdub

Voice cloning technology integrated into Descript's audio/video editor.

Editor

WellSaid Labs

Text-to-speech platform for learning and marketing content.

Professional

Coqui TTS

Open-source deep learning toolkit for Text-to-Speech.

Open Source

Audio Editing and Enhancement

Adobe Podcast (formerly Enhance Speech)

AI-powered tool to enhance speech recordings and remove background noise.

Adobe

Audacity AI Tools

AI-powered audio enhancement tools in the free Audacity editor.

Free

Descript Clarity

Background noise and reverb reduction tool in Descript.

Editor

iZotope RX

Professional audio repair and enhancement suite with AI-powered tools.

Professional

Podcastle

All-in-one platform for podcast recording and editing with AI enhancement.

Podcast

Facebook Denoiser

Open-source real-time audio denoising library.

Open Source

Ethical Considerations in AI Audio

The rise of AI audio tools, particularly voice synthesis and cloning, raises important ethical concerns that creators and users should be aware of:

  • Consent: Always obtain explicit permission before cloning someone's voice.
  • Impersonation: Voice cloning can be misused for fraud, misinformation, or impersonation.
  • Transparency: Disclose when AI-generated voices are used in content.
  • Copyright: Consider rights implications when generating music similar to existing artists.
  • Watermarking: Support efforts to watermark AI-generated audio for detection.

Many reputable AI audio companies have implemented safeguards and terms of service to prevent misuse, but responsible usage remains the responsibility of creators.

Other

Beyond text, image, video, and audio, generative AI is making significant impacts across various other domains and applications. This section explores innovative use cases and specialized applications of generative AI technology.

3D and Game Development

Luma AI

Create 3D objects and scenes from text or images with NeRF technology.

3D

Getimg.ai 3D

Text-to-3D model generator for creating 3D assets.

3D

Blockade Labs

Creator of Skybox AI for generating immersive 3D worlds from text.

VR/AR

NVIDIA Omniverse

Platform for 3D workflows with AI generation capabilities.

NVIDIA

Scenario

AI-powered game asset generation platform.

Game Dev

AI Generated

Platform for discovering and downloading AI-generated game assets.

Game Dev

Science and Medicine

AlphaFold

DeepMind's AI system for predicting protein structures.

Google

Insilico Medicine

AI-driven drug discovery and development platform.

Medicine

Atomwise

AI for structure-based small molecule drug discovery.

Drug Discovery

DeepCell

AI platform for cell classification and analysis.

Cell Biology

Isomorphic Labs

DeepMind spinoff applying AI to drug discovery.

Google

GenerativeBio

Open-source tools for generative biology research.

Open Source

Multimodal Applications

GPT-4o

OpenAI's multimodal model that can process text, images, audio, and video inputs.

OpenAI

Gemini

Google DeepMind's multimodal AI model that works across text, images, audio, video, and code.

Google

Claude 3

Anthropic's multimodal AI assistant that handles text and vision inputs.

Anthropic

Perplexity

AI search engine that can interpret and respond to text and image queries.

Search

LLaVA

Open-source multimodal model for visual and language understanding.

Open Source

MiniGPT-4

Open-source implementation of vision-language capabilities.

Open Source

Learning Resources

This section compiles high-quality resources for learning about generative AI, from beginner-friendly courses to advanced research papers. Whether you're just getting started or looking to deepen your expertise, these resources will help you understand the technology behind generative AI.

Courses and Tutorials

ChatGPT Prompt Engineering for Developers

Course by DeepLearning.AI and OpenAI on prompt engineering techniques.

Free Beginner

Practical Deep Learning for Coders

Fast.ai's course covering deep learning fundamentals with practical applications.

Free Intermediate

Generative AI Specialization

DeepLearning.AI course covering the full spectrum of generative AI models and applications.

Paid Beginner

Hugging Face NLP Course

Learn how to use transformers for natural language processing tasks.

Free Intermediate

Andrej Karpathy's Neural Networks Zero to Hero

YouTube tutorial series covering neural networks from basic principles to advanced models.

Free Intermediate

Stanford CS324: Large Language Models

Stanford course covering foundations, capabilities, and limitations of LLMs.

Free Advanced

Books and Publications

Deep Learning

Comprehensive textbook by Ian Goodfellow, Yoshua Bengio, and Aaron Courville.

Free Advanced

Deep Learning for Coders with fastai and PyTorch

Practical guide to deep learning by Jeremy Howard and Sylvain Gugger.

Free Intermediate

GPT-4 Technical Report

OpenAI's technical report on GPT-4's capabilities and limitations.

Free Advanced

Attention Is All You Need

Landmark paper introducing the Transformer architecture that powers modern AI.

Free Advanced

Generative Deep Learning

Book by David Foster on building and training generative models.

Paid Intermediate

Language Models are Few-Shot Learners

GPT-3 paper introducing few-shot learning capabilities in large language models.

Free Advanced

Communities and Forums

Hugging Face Community

Platform to share and discover machine learning models, demos, and applications.

Free

r/MachineLearning

Reddit community for machine learning discussions and news.

Free

Midjourney Discord

Community for AI image generation and discussions.

Free

PyTorch Forums

Discussion forums for PyTorch users and developers.

Free

Kaggle

Platform for data science competitions, notebooks, and discussions.

Free

GitHub Generative AI Communities

Open-source communities and repositories focused on generative AI.

Free

More Lists

Looking for more curated resources on generative AI and related topics? Check out these other awesome lists that provide specialized collections in various domains.

Awesome Generative AI

A curated list of modern Generative Artificial Intelligence projects and services.

GitHub

Awesome LLM

Curated list of Large Language Models resources, focusing on open-source alternatives.

GitHub

Awesome ChatGPT

Curated list of ChatGPT resources, tools, and applications.

GitHub

Awesome Generative AI Guide

A beginner-friendly guide to generative AI tools and frameworks.

GitHub

Awesome AI Agents

Resources for building autonomous AI agent systems.

GitHub

Awesome Stable Diffusion

Curated list of Stable Diffusion resources, models, and tools.

GitHub

This list is maintained by the community. Contributions are welcome!

Back to Top

© 2024 Awesome Generative AI | Licensed under CC BY-SA 4.0