Skip to content

acm-projects/docVault

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

3 Commits
Β 
Β 

Repository files navigation

docVault

πŸ“‚ DocVault: A Secure Digital Document Management Solution

πŸ“Œ Overview

DocVault is a secure and user-friendly digital document management solution designed for individuals, families, and businesses. It centralizes critical files like IDs, financial statements, contracts, and medical records, ensuring secure storage, efficient organization, and easy accessibility.

The core feature of DocVault is a Chrome extension that categorizes and stores downloaded documents automatically using content and metadata analysis. Additionally, it integrates AWS Textract for text extraction, AWS Translate for document translation, and AI-driven guidance for form completion and document-related queries.

✨ Features

  • Chrome Extension: Automatically categorizes and stores documents upon download.
  • AWS Textract: Extracts text from documents for content analysis.
  • Automatic Categorization: Organizes documents into relevant folders (Financial, Legal, Personal, etc.).
  • AWS Translate Integration: Translates non-English documents into English.
  • AI Chatbot: Answers document-related queries and assists with form completion.
  • Secure Cloud Storage: Documents stored safely in Amazon S3 with easy retrieval.

πŸ›  Minimum Viable Product (MVP)

Chrome Extension

βœ… Detects when a document is downloaded.
βœ… Extracts content and categorizes it automatically using predefined patterns (e.g., "Invoice", "Driver's License").

Backend System

βœ… Secure document storage in AWS S3.
βœ… AWS Textract for extracting text and categorizing documents.
βœ… Metadata storage using AWS DynamoDB.

Document Translation

βœ… Uses AWS Translate to translate non-English documents and store them appropriately.

AI Integration

βœ… AI chatbot answers questions related to stored/downloaded documents (e.g., "When does my passport expire?").
βœ… Assists users in filling out forms.

Web Interface

βœ… Allows users to log in, view, search, and manage stored documents.


πŸš€ Stretch Goals

πŸ’‘ Advanced Document Classification: Machine learning-based classification for improved accuracy.
πŸ’‘ Secure Document Sharing: Role-based access control for sharing documents.
πŸ’‘ OCR for Scanned Documents: Extract text from scanned PDFs/images.
πŸ’‘ Cross-Browser Support: Extend the extension to Firefox and Safari.
πŸ’‘ Mobile App: React Native or Flutter app for on-the-go document management.
πŸ’‘ Document Annotation: Enable users to annotate and comment on stored documents.


Milestones

Week 1-2: Project Planning & Setup
  • Define project goals and milestones
  • Set up GitHub repository
  • Finalize tech stack
  • Establish development environment
  • Assign team roles
Week 3-4: Chrome Extension Development & AWS S3 Integration

Frontend:

  • Develop initial Chrome extension UI
  • Implement basic user interactions

Backend:

  • Integrate AWS S3 for document storage
  • Implement initial categorization features
Week 5-6: Backend Development & Authentication

Frontend:

  • Set up UI components for authentication
  • Connect frontend to backend API

Backend:

  • Build API Gateway, Lambda functions, and DynamoDB integration
  • Implement user authentication system
  • Deploy backend infrastructure
Week 7-8: AI Chatbot & AWS Translate Integration

Frontend:

  • Enhance UI for chatbot interactions
  • Improve document categorization UI

Backend:

  • Implement AI chatbot for document-related queries
  • Improve document categorization using AI
  • Integrate AWS Translate for multilingual support
Week 9: Testing Phase
  • Conduct unit testing
  • Perform integration testing
  • Conduct user testing and gather feedback
Week 10: Final Refinements & Documentation
  • Fix bugs and polish UI
  • Finalize documentation
  • Prepare for presentation

πŸ— Tech Stack

Frontend

  • Next.js – Web interface for document management.
  • Chrome Extension API – Handles document downloads.

Backend

  • AWS API Gateway – API handling.
  • AWS S3 – Secure storage for categorized documents.
  • AWS Lambda – Handles document processing.
  • AWS Textract – Extracts text for document categorization.
  • AWS Translate – Translates documents into English.
  • AWS DynamoDB – Stores document metadata.

Authentication

  • Amazon Cognito – User authentication and authorization.

πŸ† Competition

Competitor Challenges
Google Drive/OneDrive Requires manual document organization.
DocuSign No automatic document categorization.
Evernote/Notion Lacks AI-powered document management.

βœ… DocVault Advantage: Automation, AI assistance, secure cloud storage.


🚧 Challenges & Solutions

🚨 Many features to complete in 10 weeks β†’ Break tasks into small components and refine iteratively.
🚨 Beginners unfamiliar with AI integration β†’ Start researching early, integrate AI after core components.
🚨 Handling both extension & web app β†’ Keep UI simple and intuitive.


πŸ“š Additional Resources

Chrome Extension Development

AWS Services

Development Tools

  • GitHub Docs
  • Visual Studio Code
  • Postman

Developers πŸ‘₯

  • Aima Salman
  • Noah Flores
  • Nicholas Remack
  • Sreenivasa Sobhirala

Project Team

  • Nadeeba Atiqui - Project Manager
  • Joanna Borba - Industry Mentor

πŸš€ Get Started with DocVault Development

Clone the repository and follow the setup instructions to contribute!

  git clone https://github.com/your-repo/docvault.git
  cd docvault
  npm install
  npm run dev

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published