Skip to content

Pentagram is a web app for generating images from text prompts with social interaction features.

Notifications You must be signed in to change notification settings

RaoAkif/pentagram

Repository files navigation

Headstarter Banner

Pentagram: Realtime Image Diffusion

Pentagram is a web app that enables users to generate images from text prompts using a self-hosted image generation model, like Stable Diffusion, on serverless GPUs. It offers an Instagram-like experience, where users can create, like, comment, and share images, all while ensuring low-latency performance for a smooth interaction.

Project Overview

In this project, you will build a web app where users can generate images using text prompts via an image generation model that you host on serverless GPUs.

Technologies Used:

  • React
  • TypeScript
  • Modal
  • APIs
  • Authentication
  • Hardware Infrastructure

Project Concept:

Instead of users uploading their own pictures, they can generate images with text prompts. You'll host an image generation model (e.g., Stable Diffusion) yourself on serverless GPUs, ensuring low-latency performance for a smooth user experience. This app will have an Instagram-like interface, where users can interact socially through likes, comments, and sharing features.


Getting Started

1. Clone the Repository

First, clone the GitHub repository:

git clone https://github.com/team-headstart/pentagram.git

Navigate to the project directory:

cd pentagram

Install the dependencies:

npm install

Run the development server:

npm run dev

Open http://localhost:3000 with your browser to see the result.


Tasks

Important Files to Review

  • src/app/page.tsx:
    This is where the user can input their prompt and generate an image. Ensure that you update the UI and handle the API response to display the generated images.

  • src/app/api/generate-image/route.ts:
    This is where the image generation API is implemented. Make sure to call your image generation API hosted on Modal here.


Project Requirements

  • Host an Image Generation Model on Serverless GPUs:
    Host a model like Stable Diffusion on Modal, ensuring low-latency performance for a smooth user experience.

  • Web App Features:
    Create a web app that allows users to generate images from text prompts, manage their creations, and interact socially with features like likes, comments, and sharing.

  • UI/UX and Image Management:
    Build an intuitive UI/UX with proper authentication and efficient image management. Include a prompt history to enhance the user experience.


Resources

Here are some useful resources to get started:


Learn More

To learn more about Next.js, take a look at the following resources:


Authors

👤 Rao Akif


Contributing

Contributions, issues, and feature requests are welcome! Check the issues page for ongoing issues.


Acknowledgments

  • Inspiration from image generation platforms like Midjourney and Stable Diffusion.
  • Thanks to Modal for providing serverless GPU hosting.

License

This project is licensed under the MIT License. See the LICENSE file for details.


This update brings the README in line with the specific requirements for hosting image generation models and handling user interactions through a web app.

About

Pentagram is a web app for generating images from text prompts with social interaction features.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published