GPOEO is a micro-intrusive GPU online energy optimization framework for iterative applications

Last update: Sep 10, 2022

Related tags

Overview

GPOEO

GPOEO is a micro-intrusive GPU online energy optimization framework for iterative applications. We also implement ODPP [1] as a comparison.

[1] P. Zou, L. Ang, K. Barker, and R. Ge, “Indicator-directed dynamic power management for iterative workloads on gpu-accelerated systems,” in 2020 20th IEEE/ACM International Symposium on Cluster, Cloud and Internet Computing (CCGRID). IEEE, 2020, pp. 559-568.

./EPOpt contains source code of the GPOEO and ODPP [1].
./PerformanceMeasurement (PerfMeasure) is a NVIDIA GPU measurer for energy/power/utilities/clocks

Make GPOEO

Modify pathes of headers and libraries in ./EPOpt/makefile . cd ./EPOpt && mkdir ./build && cp makefile ./build cd ./build && make

Make PerfMeasure

Modify pathes of headers and libraries in ./PerformanceMeasurement/makefile . cd ./PerformanceMeasurement && mkdir ./build && cp makefile ./build cd ./build && make

Use GPOEO in python applications

GPOEO only has two APIs:

Begin(GPUID4CUDA, GPUID4NVML, RunMode, MeasureOutDir, ModelDir, TestPrefix)
End()

GPUID4CUDA: GPU ID used in CUDA environment.

GPUID4NVML: GPU ID queried with nvidia-smi and used to initialize CUPTI.

RunMode: "WORK" (run energy saving online); "MEASURE" (measure hardware performance counter metrics and other data for training multi-objective prediction models).

MeasureOutDir: measurement output file path.

ModelDir: the path of multi-objective prediction models.

TestPrefix: prefix name of one run.

The two APIs should be inserted at the beginning and end of the main python file respectively. As shown below:

from PyEPOpt import EPOpt

if __name__=="__main__":
    EPOpt.Begin(GPUID4CUDA, GPUID4NVML, RunMode, MeasureOutDir, ModelDir, TestPrefix)

    .....

    EPOpt.End()

Use ODPP [1] in python applications

ODPP can be implemented as a daemon. However, for the convenience of comparing GPOEO and ODPP, we also implement ODPP into the same form: two APIs.

ODPPBegin(GPUID4CUDA, GPUID4NVML, RunMode, MeasureOutDir, ModelDir, TestPrefix)
ODPPEnd()

GPUID4CUDA: GPU ID used in CUDA environment.

GPUID4NVML: GPU ID queried with nvidia-smi and used to initialize CUPTI.

RunMode: "ODPP" (run ODPP online).

MeasureOutDir: not used.

ModelDir: the path of ODPP models.

TestPrefix: prefix name of one run.

The two APIs should be inserted at the beginning and end of the main python file respectively. As shown below:

from ODPP import ODPPBegin, ODPPEnd

if __name__=="__main__":
    ODPPBegin(GPUID4CUDA, GPUID4NVML, RunMode, MeasureOutDir, ModelDir, TestPrefix)

    .....

    ODPPEnd()

GPOEO is a micro-intrusive GPU online energy optimization framework for iterative applications

Related tags

Overview

GPOEO

Make GPOEO

Make PerfMeasure

Use GPOEO in python applications

Use ODPP [1] in python applications

Owner

瑞雪轻飏

Causal estimators for use with WhyNot

Research code for CVPR 2021 paper "End-to-End Human Pose and Mesh Reconstruction with Transformers"

[CVPR 2022] PoseTriplet: Co-evolving 3D Human Pose Estimation, Imitation, and Hallucination under Self-supervision (Oral)

Unofficial implementation of Proxy Anchor Loss for Deep Metric Learning

SporeAgent: Reinforced Scene-level Plausibility for Object Pose Refinement

GuideDog is an AI/ML-based mobile app designed to assist the lives of the visually impaired, 100% voice-controlled

一套完整的微博舆情分析流程代码，包括微博爬虫、LDA主题分析和情感分析。

Deep learning algorithms for muon momentum estimation in the CMS Trigger System

[ICLR 2021 Spotlight Oral] "Undistillable: Making A Nasty Teacher That CANNOT teach students", Haoyu Ma, Tianlong Chen, Ting-Kuei Hu, Chenyu You, Xiaohui Xie, Zhangyang Wang

Weakly Supervised Text-to-SQL Parsing through Question Decomposition

Swin-Transformer is basically a hierarchical Transformer whose representation is computed with shifted windows.

A hue shift helper for OBS

A scanpy extension to analyse single-cell TCR and BCR data.

Lab Materials for MIT 6.S191: Introduction to Deep Learning

Our CIKM21 Paper "Incorporating Query Reformulating Behavior into Web Search Evaluation"

Leaderboard, taxonomy, and curated list of few-shot object detection papers.

天勤量化开发包, 期货量化, 实时行情/历史数据/实盘交易

Implementation EfficientDet: Scalable and Efficient Object Detection in PyTorch

Solving Zero-Shot Learning in Named Entity Recognition with Common Sense Knowledge

Official implement of Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer