MediaPipeで姿勢推定を行い、Tokyo2020オリンピック風のピクトグラムを表示するデモ

Last update: Dec 26, 2022

Overview

Tokyo2020-Pictogram-using-MediaPipe

MediaPipeで姿勢推定を行い、Tokyo2020オリンピック風のピクトグラムを表示するデモです。

Tokyo2020Pictgram02.mp4

Requirement

mediapipe 0.8.6 or later
OpenCV 3.4.2 or later

Demo

以下コマンドでデモを起動してください。
ESCキー押下でプログラム終了します。

python main.py

--device
カメラデバイス番号の指定
デフォルト：0
--width
カメラキャプチャ時の横幅
デフォルト：640
--height
カメラキャプチャ時の縦幅
デフォルト：360
--static_image_mode
静止画モード
デフォルト：指定なし
--model_complexity
モデルの複雑度(0:Lite 1:Full 2:Heavy)
※性能差はPose Estimation Qualityを参照ください
デフォルト：1
--min_detection_confidence
検出信頼値の閾値
デフォルト：0.5
--min_tracking_confidence
トラッキング信頼値の閾値
デフォルト：0.5
--rev_color
背景色とピクトグラムの色を反転する
デフォルト：指定なし

Using Docker

Ubuntuの場合はホストマシンにMediaPipeをインストールせず、Docker + docker-composeを使うこともできます。

まず環境に合わせてdocker-compose.ymlを編集します。
ビデオデバイスを指定する際video0を使う場合は以下のように編集します。

    # Edit here
    devices:
      # - "/dev/video0:/dev/video0"
      # - "/dev/video1:/dev/video0"
-     - "/dev/video2:/dev/video0"
+     - "/dev/video0:/dev/video0"

次にDockerイメージをビルドします。

docker-compose build

最後にDockerコンテナを起動します。

docker-compose up

Author

高橋かずひと(https://twitter.com/KzhtTkhs)

License

Tokyo2020-Pictogram-using-MediaPipe is under Apache-2.0 License.

MediaPipeで姿勢推定を行い、Tokyo2020オリンピック風のピクトグラムを表示するデモ

Related tags

Overview

Tokyo2020-Pictogram-using-MediaPipe

Requirement

Demo

Using Docker

Author

License

Owner

KazuhitoTakahashi

A voice recognition assistant similar to amazon alexa, siri and google assistant.

Lightweight Face Image Quality Assessment

PyTorch code for: Learning to Generate Grounded Visual Captions without Localization Supervision

Large Scale Multi-Illuminant (LSMI) Dataset for Developing White Balance Algorithm under Mixed Illumination

torchbearer: A model fitting library for PyTorch

Augmented CLIP - Training simple models to predict CLIP image embeddings from text embeddings, and vice versa.

FlexConv: Continuous Kernel Convolutions with Differentiable Kernel Sizes

Implementations of CNNs, RNNs, GANs, etc

Simple embedding based text classifier inspired by fastText, implemented in tensorflow

The most simple and minimalistic navigation dashboard.

Unet network with mean teacher for altrasound image segmentation

Official pytorch implementation of paper Dual-Level Collaborative Transformer for Image Captioning (AAAI 2021).

Seeing All the Angles: Learning Multiview Manipulation Policies for Contact-Rich Tasks from Demonstrations

A Tensorflow implementation of BicycleGAN.

Face Mask Detection on Image and Video using tensorflow and keras

Official Repo for ICCV2021 Paper: Learning to Regress Bodies from Images using Differentiable Semantic Rendering

This repository provides the code for MedViLL(Medical Vision Language Learner).

Deep Learning: Architectures & Methods Project: Deep Learning for Audio Super-Resolution

Customer-Transaction-Analysis - This analysis is based on a synthesised transaction dataset containing 3 months worth of transactions for 100 hypothetical customers.

Official implementation of Unfolded Deep Kernel Estimation for Blind Image Super-resolution.