Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[EPIC] Data Simulation #277

Open
DMalone87 opened this issue May 12, 2023 · 5 comments
Open

[EPIC] Data Simulation #277

DMalone87 opened this issue May 12, 2023 · 5 comments
Labels
epic Large issues and features that involve multiple PR's or sub-issues

Comments

@DMalone87
Copy link
Collaborator

Is your feature request related to a problem? Please describe.
In order to propel the project forward, we need to be able to demonstrate the full functionality of the application once we have reached our finished state. To help us do this, we should create a function that will simulate data being entered into the index from a variety of sources.

Describe the solution you'd like
Create a function that simulates a large set of data being entered into the index. When run, the function should create a set of entries to the database as though they were uploaded via an API push. Each dataset should include a simulated source and at least 1000 separate incidents.

Requirements:

  • The function should allow the caller to choose which zip code the incidents occurred in
  • The function should be able to spread the events across a selected time range
  • The function should generate an address for each entry that will properly be displayed on the map
  • The generated entries should come with random protected record attachments.

Describe alternatives you've considered
This feature is meant to help us demonstrate the functionality of the app and test advanced features until we have live data imports from partners.

Additional context
The records from Citizens Police Data Project can serve as a model for the simulated entries.

@DMalone87
Copy link
Collaborator Author

@ren0nie0, want to give it a shot?

@ren0nie0
Copy link
Contributor

ren0nie0 commented May 13, 2023 via email

@ren0nie0
Copy link
Contributor

I found this library to generate random addresses: random-address 1.1.1

@DMalone87
Copy link
Collaborator Author

This the one? https://pypi.org/project/random-address/

Definitely looks like it'll serve well here.

To your other question, I was referring to these icons:
Screenshot 2023-05-16 at 7 04 04 PM
You can see them on the lower right side. Those are meant to represent an incident record that has some additional data attached, as referenced in the Incident Model.

@DMalone87
Copy link
Collaborator Author

So, I did a little digging. At the moment, we've got three major things that need to be completed before we can do this properly.

  1. Currently there's an API call that will let you enter a single incident record, but not a set. We'll need to add a batch entry API call.
  2. There are some major issues with the current incident model that's being used in the software. Key things are missing from the Incident Model we defined at the beginning of the process that we should include. There are also a few misspellings I noticed.
  3. Obviously, we'll need to create the randomizer function. I think it makes sense to include this as an option for the batch entry API command.

I'll make separate issues for each of these and we con consider this issue completed when those three components are completed.

@DMalone87 DMalone87 added the epic Large issues and features that involve multiple PR's or sub-issues label Aug 16, 2023
@DMalone87 DMalone87 changed the title [FEATURE] Data Simulation [EPIC] Data Simulation Aug 16, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
epic Large issues and features that involve multiple PR's or sub-issues
Projects
None yet
Development

No branches or pull requests

2 participants