Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enhanced Content Management and AI Integration #84

Merged
merged 31 commits into from
Sep 9, 2024
Merged

Conversation

zhuojg
Copy link
Contributor

@zhuojg zhuojg commented Aug 22, 2024

Key Features

ContentBase Framework

Introducing a new series of crates to replace file-handler:

GenDAM-specific:

  • content-base: Core functionality for GenDAM
  • content-base-pool: Task pool implementation for GenDAM

General-purpose:

  • content-base-context: Task execution context
  • content-base-task: File-related task definitions
  • content-handler: File processing logic
  • content-metadata: File metadata definitions

Expanded File Type Support

  • Added support for video, audio, image, raw text, and web pages (direct URL import)
  • Implementation: content-base (Rust-side)
  • Enhanced TS types for improved Developer Experience
  • Refined asset metadata handling:
    • Primary check: File extension
    • Secondary check: File magic number (if type is indeterminate)
  • Streamlined asset metadata storage in database:
    • Removed MediaData model
    • Direct string storage of metadata in AssetObject

RAG-based Copilot

  • Leveraging Large Language Models (LLMs) for answer generation using retrieved assets

Additional Improvements

  • Enhanced support for LLM serving via Ollama
  • Qdrant client upgrade
    • Note: Recommend upgrading Qdrant server to 1.10+ for development

Breaking Changes

  • Asset metadata storage structure has been modified
  • All artifacts should be regenerated

@zhuojg zhuojg requested a review from web3nomad August 22, 2024 11:27
@zhuojg zhuojg added this to the Public Alpha 0.2 milestone Aug 22, 2024
@zhuojg
Copy link
Contributor Author

zhuojg commented Aug 22, 2024

#72 fixed.

Use image crate instead of ffmpeg to handle image files

@zhuojg
Copy link
Contributor Author

zhuojg commented Aug 22, 2024

Related to #59.

  • artifacts.json is more robust.
  • Thumbnail information is saved into artifacts.json.

@zhuojg zhuojg changed the title (WIP) Enhanced Content Management and AI Integration Enhanced Content Management and AI Integration Sep 9, 2024
@zhuojg zhuojg merged commit 320a764 into main Sep 9, 2024
@zhuojg zhuojg deleted the feat/content-base branch September 9, 2024 03:29
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant