📄 Document Management API Challenge

Overview 🚀

In this challenge, you will build a backend API service to manage large PDF documents. The service must allow users to upload, search, and download PDF documents while efficiently handling resources, given a memory limitation of 50MB assigned to the document management service container. This challenge is designed for a mid-senior engineer to demonstrate advanced skills in Spring Boot, Java, REST API development, testing, containerization, and cloud storage integration.

Functional Requirements ✅

1. Upload Endpoint ⬆️

Functionality:
Allow uploading a PDF document along with the following metadata:
- User: A string identifying the user associated with the document.
- Document Name: The name provided in the request will be used as the file name.
- Tags: A list of tags associated with the document.
Technical Constraints:
- The service must handle PDF uploads of up to 500MB.
- The uploaded PDF should be stored in an bucket (simulated via MinIO) with the following directory structure:
```
document-bucket/
  ├─ user1/
  │  ├─ doc1.pdf
  │  ├─ doc2.pdf
  ├─ user2/
  │  ├─ doc3.pdf
```
- Metadata must be persisted in a PostgreSQL database with the following fields:
  - User
  - Document Name
  - Tags
  - MinIO Path
  - File Size
  - File Type
  - Created At
  - Include any additional fields you deem necessary

📌 Storage Requirement: Uploading Documents to MinIO

All uploaded documents must be stored in MinIO to ensure scalability and efficient storage management. The service will interact with MinIO to handle file uploads and generate temporary access URLs for retrieval. For detailed instructions on how to set up and use MinIO locally, please refer to the following document: 📄 MinIO Local Setup Guide.

2. Search Endpoint 🔍

Functionality:
Allow querying documents with optional filters:
- Filters: User, Document Name, and Tags.
- If no filters are provided, return all documents.
- Results should be ordered by created_at in descending order.
- The endpoint must support pagination using page and size parameters.
Note:
This endpoint should not return any download URL.

3. Download Endpoint ⬇️

Functionality:
Allow downloading a document using its ID. The endpoint should return a temporary download URL that enables secure access to the document stored in MinIO.
Implementation:
Generate a temporary download URL using MinIO’s pre-signed URL functionality. The service will utilize MinIO to generate a temporary download link based on the document's ID, allowing the document to be securely accessed without exposing direct storage paths.

Note:

For more details on how to use MinIO, refer to the documentation: 📄 MinIO Local Setup Guide.

Technical Requirements ⚙️

Memory Limitation:
The service memory is limited to 50MB. You must design your solution to efficiently manage memory during file upload and processing, even when handling uploads of files up to 500MB.
Concurrent Uploads:
The system must be capable of handling up to 10 documents being uploaded in parallel, with each document having a size of up to 500MB.
Upload time limit:
There are no restrictions on the time it takes to upload files. Only, ensure that the service can handle uploads of up to 500MB without exceeding the memory limitation.
Provided Artifacts:
- OpenAPI specification that includes the contract for the endpoints.
  - Reference: document-management-open-api.yml.
  - You can visualize the content using Swagger Editor.
- A docker-compose stack that includes PostgreSQL, and the Document Management Service.
- Integrated tools:
  - Spring Boot: The project is pre-configured with Spring Boot.
  - Spring Data JPA: For database operations.
  - MinIO: For simulating bucket operations services locally.
  - Lombok: For reducing boilerplate code.
  - JUnit 5: For unit and integration testing.
  - Mockito: For mocking dependencies in tests.
  - AssertJ: For fluent assertions in tests.
  - Jacoco: for code coverage (run ./mvnw jacoco:report to generate the report).
  - Spotless: for code formatting (run ./mvnw spotless:apply to format your code).
Java Version:
The project is configured with Java 17, but you may restrict your solution to features available in Java 8 if necessary.
Schema Management:
Provide a script for creating the database schema, ensuring efficient handling of multiple tags per document.
Documentation:
(Optional) Include OpenAPI documentation for the API endpoints.

Implementation Instructions 🛠️

Use this repository as the starting point for your solution. If possible, create a fork of the repository.
Implement the endpoints as per the provided OpenAPI specification.
Configure an MinIO client.
Configure a connection to PostgreSQL.
Include your database schema script in docker/init-scripts/schema-init.sql.
Create the Dockerfile for the document-management-service.
Modify the docker-compose.yml file to add the necessary configuration for including the document-management-service in the stack. Ensure that the service correctly connects to PostgreSQL and MinIO.
Implement the required functionality for the Document Management Service.
Once your functionality is ready, validate it using Postman. Please note that you must start the stack using docker-compose up --build.
Commit your changes. It is recommended to maintain a clean commit history, ideally using Conventional Commits.
Push your changes to a personal GitHub account and share the URL of your solution.

⚠️ Note: All configurations (database credentials, MinIO/S3 settings, etc.) must be externalized using environment variables and configuration files. Avoid hardcoding sensitive information in the source code.

Evaluation Criteria 🏆

Database Schema and Indexing:
Evaluate the efficiency of your database schema, including the creation of indices and the management of multiple tags per document.
Design Patterns and Best Practices:
Assess the use of design patterns (e.g., Controller-Service-Repository or Hexagonal Architecture) and adherence to SOLID principles and clean code practices.
Code Quality:
Review for readability, maintainability, proper exception handling, and overall coding standards.
Testing:
Evaluate the quality and coverage of unit and integration tests. While no specific coverage percentage is required, tests should cover the most critical functionalities and edge cases.
Spring Boot and Java Proficiency:
Demonstrate effective use of Spring Boot features and Java (preferably Java 17, though Java 8+ is acceptable).
Additional Considerations:
- Overall robustness and efficiency under concurrent file uploads.
- Validations on models and DTOs (e.g., non-null constraints).
- (Optional) OpenAPI documentation.

Challenge Priorities 🎯

Upload Service:
- Primary focus on implementing a robust upload endpoint that efficiently handles large file (up to 500MB of size) uploads within the 50MB memory constraint.
Search Service:
- Implement a flexible and efficient search endpoint with filtering, sorting, and pagination.
Download Service:
- Provide document download functionality via temporary AWS S3 URLs.

Note: It is acceptable to implement a subset of the endpoints. However, the more complete your solution, the better.

Submission Instructions 📤

Ensure that your solution includes the Dockerfile and database schema script, and that it adheres to the challenge requirements.

Additional Comments 💬

If you have any additional notes, explanations, or assumptions regarding your implementation, feel free to include them in this section. This can help provide more context to reviewers.

⚠️ Important Note About the Challenge Completion ⚠️

Even if you are unable to complete the challenge 100%, please explain why you couldn't proceed, what doubts you had, and any blockers you encountered. We will review each case individually to determine how it impacts the evaluation.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.mvn/wrapper		.mvn/wrapper
docker		docker
docs		docs
src		src
.gitattributes		.gitattributes
.gitignore		.gitignore
README.md		README.md
mvnw		mvnw
mvnw.cmd		mvnw.cmd
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

📄 Document Management API Challenge

Overview 🚀

Functional Requirements ✅

1. Upload Endpoint ⬆️

2. Search Endpoint 🔍

3. Download Endpoint ⬇️

Note:

Technical Requirements ⚙️

Implementation Instructions 🛠️

Evaluation Criteria 🏆

Challenge Priorities 🎯

Submission Instructions 📤

Additional Comments 💬

Note: Your approach, problem-solving skills, and reasoning are just as important as the final implementation.

About

Uh oh!

Releases

Packages

Uh oh!

Languages

clara-team/document-management-service-challenge

Folders and files

Latest commit

History

Repository files navigation

📄 Document Management API Challenge

Overview 🚀

Functional Requirements ✅

1. Upload Endpoint ⬆️

2. Search Endpoint 🔍

3. Download Endpoint ⬇️

Note:

Technical Requirements ⚙️

Implementation Instructions 🛠️

Evaluation Criteria 🏆

Challenge Priorities 🎯

Submission Instructions 📤

Additional Comments 💬

Note: Your approach, problem-solving skills, and reasoning are just as important as the final implementation.

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages