Skip to content
/ hunting Public

Hunting is data profiling exposed as REST API. It is built on top of Pandas Profiling & Polars.

Notifications You must be signed in to change notification settings

factly/hunting

Repository files navigation

Hunting

The easiest way to explore or contribute to Hunting is to open the project on Gitpod or on GitHub Codespaces:

Open in Gitpod

Pre-requisites

  • Currently the setup is only tested for development on Mac OS and Linux
  • Install and run Docker and Docker Compose

Starting the application

  • Execute the following command docker-compose command to start the entire NEDC Database application and the dependent services

      docker-compose up
    
  • When the application is started using docker-compose, a directory with name volumes will be created in the directory where all the docker data for all the services will be persisted.

Access the application

Once the application is up and running you should be able to access it using the following urls:

Service URL
Server API Root: http://0.0.0.0:8000/api/v1
Swagger: http://0.0.0.0:8000/api/docs
Redoc: http://0.0.0.0:8000/redoc
MongoDB http://localhost:27017
Username: root
Password: example
Redis http://localhost:6379
Password: password
Flower Dashboard: http://localhost:5555

Stopping the application

  • Execute the following command docker-compose command to stop Dega and all the components

      docker-compose stop
    

Or use the following command to stop the application, and remove all the containers and networks that were created:

  docker-compose down

Environment variables

  • Create .env file in the root directory based on .env.example.
    • The values in .env.example are pre-configured to running the application using the default docker-compose.yml
  • If no .env file is found in the root directory, the default values provided in /app/core/config.py will be considered for the environment variables.
    • The values in /app/core/config.py are pre-configured to running the application using the default docker-compose.yml

Prefetch

  • Prefetch group of routes will only be enabled if ENABLE_PREFETCH is true
  • Flower Dashboard

S3

  • Limitation is that it will only supports one set of AWS credentials that has access to all the S3 buckets.
  • S3 URL expected in the following pattern: s3://bucket_name/path/to/file/file_name.csv

Background Task

TODO

About

Hunting is data profiling exposed as REST API. It is built on top of Pandas Profiling & Polars.

Resources

Stars

Watchers

Forks

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •