Skip to content
This repository has been archived by the owner on Jun 25, 2023. It is now read-only.

martialmarel/migration-data-mongo

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

1 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

MongoDB Migration Node Script

Exercice

You work at a Bitcoin exchange company. For some reason, the customer data is missing its full address. You have customer data only with the street address (building number and street name) but no city, state, country. Also, phone numbers are missing too. Luckily, your friends were able to restore the address information from a backup replica of a MongoDB instance. You need to write a migration/restoration script which will merge the data from the two sources.

You have millions of records so you need to create a script which can run queries to the database in parallel. You don't know what is the optimal number of customers to insert into a database at a time so you need to write the program to allow for a variable number of documents to be able to be updated at once. This will help to determine if it's better to update in groups of 10, 50 or maybe 500 at a time. Assume that the order of the objects in each file correlates to objects in the other file.

Submission

The script merges the data from two JSON files. The data is then inserted to Mongo in parallel. For parrallelism using library https://caolan.github.io/async/

To launch migration script, use command-line. The migration script require one paramater : its the number of insertions through each parrellel tasks:

node migrate-data [number of objects per function] by example node migrate-data 50

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published