Implemented a performance data graphing module for batch mode #272

rayyang29 · 2022-06-13T19:10:12Z

Description of what I changed

I implemented a module for monitoring the resource usage of batch mode with HAPI as the source. This module generates graphs and csv files of resource usage (CPU, memory and I/O) of the HAPI server, postgres database and pipeline over the duration of the batch job. The user is able to specify the number of processes/cores used in the pipeline to assess batch mode's performance on the local machine.

The following files have been added in this PR:
/utils/resource-monitor/graph_pidstat.py - The driver program for generating graphs and tables of resource usage.
/utils/resource-monitor/monitor_pipeline.sh - A bash script that starts monitoring processes for the server, database and pipeline; called by the driver program.
/utils/resource-monitor/auto.sh - A bash script that automates the generation of resource usage graphs and csv files with different number of cores used in the batch job.
/utils/resource-monitor/README.md - README file for the resource-monitor module.
/docker/hapi-compose.yml - A docker .yml file for the latest version HAPI server with postgresql database.

Related to issue #266

E2E test

TESTED:

Checklist: I completed these to help reviewers :)

My IDE is configured to follow the code style of this project.

No? Unsure? -> configure your IDE, format the code and add the changes with git add . && git commit --amend
I am familiar with Google Style Guides for the language I have coded in.

No? Please take some time and review Java and Python style guides. Note, when in conflict, OpenMRS style guide overrules.
I have added tests to cover my changes. (If you refactored existing code that was well tested you do not have to add tests)

No? -> write tests and add them to this commit git add . && git commit --amend
I ran mvn clean package right before creating this pull request and added all formatting changes to my commit.
All new and existing tests passed.

No? -> figure out why and add the fix to your commit. It is your responsibility to make sure your code works.
My pull request is based on the latest changes of the master branch.

No? Unsure? -> execute command git pull --rebase upstream master

bashir2

Thanks @rayyang29 for this; we should also add the graphs with some notes in a separate doc/wiki page once you have them altogether.

docker/hapi-compose.yml

pipelines/batch/src/main/java/org/openmrs/analytics/FhirEtl.java

utils/resource-monitor/graph_pidstat.py

rayyang29

Thanks @bashir2 for the code review! I answered the questions, made style changes and ran the black formatter on the python script. Though please have a look at the format produced by the formatter. I will update the PR with the changes.

docker/hapi-compose.yml

utils/resource-monitor/auto.sh

utils/resource-monitor/graph_pidstat.py

bashir2

Thanks Ray for the changes; I think all of the remaining comments are minor/doc/questions.

utils/resource-monitor/README.md

utils/resource-monitor/auto.sh

utils/resource-monitor/graph_pidstat.py

docker/hapi-compose.yml

utils/resource-monitor/README.md

rayyang29

Thanks @bashir2 for your additional comments. I've made some minor changes and addressed all unresolved conversations.

docker/hapi-compose.yml

utils/resource-monitor/README.md

utils/resource-monitor/graph_pidstat.py

rayyang29 requested a review from bashir2 June 13, 2022 19:10

bashir2 reviewed Jun 22, 2022

View reviewed changes

rayyang29 commented Jun 22, 2022

View reviewed changes

rayyang29 requested a review from bashir2 June 23, 2022 15:05

bashir2 approved these changes Jun 28, 2022

View reviewed changes

rayyang29 commented Jul 5, 2022

View reviewed changes

docker/hapi-compose.yml Outdated Show resolved Hide resolved

docker/hapi-compose.yml Outdated Show resolved Hide resolved

utils/resource-monitor/README.md Outdated Show resolved Hide resolved

utils/resource-monitor/graph_pidstat.py Show resolved Hide resolved

Ray Yang and others added 5 commits July 5, 2022 20:29

Implemented a performance data graphing module for batch mode

70ef352

Created README file for resource-monitor module

ce942ba

Refactored performance data graphing module

6544547

Implemented minor changes

402d5bc

Merge branch 'master' into master

14c0b0b

bashir2 merged commit 80fb4de into google:master Jul 11, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implemented a performance data graphing module for batch mode #272

Implemented a performance data graphing module for batch mode #272

rayyang29 commented Jun 13, 2022 •

edited

Loading

bashir2 left a comment

rayyang29 left a comment

bashir2 left a comment

rayyang29 left a comment

Implemented a performance data graphing module for batch mode #272

Implemented a performance data graphing module for batch mode #272

Conversation

rayyang29 commented Jun 13, 2022 • edited Loading

Description of what I changed

E2E test

Checklist: I completed these to help reviewers :)

bashir2 left a comment

Choose a reason for hiding this comment

rayyang29 left a comment

Choose a reason for hiding this comment

bashir2 left a comment

Choose a reason for hiding this comment

rayyang29 left a comment

Choose a reason for hiding this comment

rayyang29 commented Jun 13, 2022 •

edited

Loading