phm2021_data_challenge

WORK IN PROGRESS - much of the code in /package is getting refactored

database implementation for the N-CMAPSS dataset that uses the same schema as the uav project (with some improvements and minor differences) with a python api.

Check out jupyter notebook for some api examples of how to use the api for data insertion and extraction.

Steps

clone the repository git clone https://github.com/darrahts/phm2021_data_challenge.git
make setup.sh executable cd phm2021_data_challenge && chmod +x setup.sh
get the raw data from the Prognostics Center of Excellence (NASA) here, unzip it, and remove “N-CMAPSS_DS02-006” and “N-CMAPSS_DS08d-010”
put the .h5 data files in the /data_h5 directory (.h5 files are ignored in gitignore)
execute setup.sh to install the database, configure the user, set up the table schema, and populate the database (user prompts y/n for different steps)
./setup.sh
[optional] create a conda environment
conda env create --file environment.yml (default name is tfgpu)
conda activate tfgpu
start jupyter lab (or notebook)
jupyter lab
open database_api_ncmapss.ipynb and if in step 5 you selected the option to populate the database, skip steps 1-9

NOTES

if you are using aws secrets you want

boto3  
base64  
oath2client  
oathlib  
openssl

and in step 4 of database_api_ncmapss.ipynb you will change
params = utils.get_aws_secret("/secret/ncmapssdb") to match your secret name

otherwise,

params = {'datasource.username': $USER, # the username of the logged in user
            'datasource.password': <password entered in step 4>, 
            'datasource.database': 'ncmapss_db', # <- NO CHANGE 
            'datasource.url': 'localhost', # <- or your database installation location
            'datasource.port': '5432'} # <- most likely don't change

besides the typical pandas/numpy stack and other packages that are probably already on your system, you will need

psycopg2
h5py

If you created a conda environment from the environment.yml, you have all of all of the required dependencies

Name		Name	Last commit message	Last commit date
Latest commit History 143 Commits
data_h5		data_h5
logs		logs
models		models
notebooks		notebooks
package		package
sql		sql
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
environment.yml		environment.yml
environment_new.yml		environment_new.yml
install_timescale.sh		install_timescale.sh
paths.csv		paths.csv
setup.sh		setup.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

phm2021_data_challenge

Steps

NOTES

About

Releases

Packages

Contributors 2

Languages

License

darrahts/phm2021_data_challenge

Folders and files

Latest commit

History

Repository files navigation

phm2021_data_challenge

Steps

NOTES

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages