LISTEN : LInux Server moniTor by rEcordiNg running status

Overview

This library implements a python script that automatically monitors the running status of a group of linux servers. Current adapted recording data includes:

CPU utilization
RAM
Storage memory
GPU utilization (only when nvidia-smi is available)
GPU memory (only when nvidia-smi is available)

There are several key fetures of LISTEN system:

Dynamic: during the monitoring process, both the linux servers being monitored and the data to be recorded could be dynamically added by editting the configuration files.
Autonomy: apart from embedded data to be recorded (i.e., CPU utilization), you can customize any data you want to record.

How to monitor

Install dependencies:

pip install requirements.txt

Configure linux server to be monitored in ./config/monitor_config.json, here is an example:

{
    "192.168.1.1":{
        "user": "panda"
    },
    "192.168.2.1":{
        "user": "cat",
        "port": 3456
    }
}

If 'port' is not assigned, LISTEN will use default one (22).

Configure ssh login for remote linux server:

ssh-copy-id [email protected]

Note that all server you want to monitor should be configured passward-free login.

Then you can start monitoring running background:

nohup python monitor.py >> ./log/running.log 2>&1 &

Finally, all monitor data will be automatically recorded into tensorboard, which can be accessed by:

tensorboard ./log --port 5555

and check them in the browser:

localhost:5555

Here is an example for successful monitoring:

More feature to be explored:

LISTEN supports different levels of printing logs, which can be included by input '--verbose' when running python script. See ./utils/argsparser.py for more information.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
config		config
fig		fig
log		log
utils		utils
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
monitor.py		monitor.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LISTEN : LInux Server moniTor by rEcordiNg running status

Overview

How to monitor

More feature to be explored:

About

Languages

License

lafmdp/LISTEN

Folders and files

Latest commit

History

Repository files navigation

LISTEN : LInux Server moniTor by rEcordiNg running status

Overview

How to monitor

More feature to be explored:

About

Topics

Resources

License

Stars

Watchers

Forks

Languages