Notes for Paper “Towards Accurate Multi-person Pose Estimation in the Wild “


Papandreou, George, et al. “Towards accurate multiperson pose estimation in the wild.” arXiv preprint arXiv:1701.01779 8 (2017).


ResNet for keypoint, heatmap and offset

  • Performance
  • Basics
    • Without ground truth of the location or the scale of the person.
    • Top-down approach
  • Main methods
    • Pipeline:
      • Person box detection using Faster-RCNN, (ResNet-101)
        • CNN backbone pre-trained on ImageNet
          • No multi-scale evaluation.
      • Person pose estimation. Use ResNet 101 for heatmap and offset.
        • K=17 keypoints.
        • Classification && Regression
          • Firstly classify whether it is (0 or 1) in the neighborhood of any keypoint. (heatmap)
          • Predict a 2D local offset vector to see the precise keypoint location.

  • Take home message
  • Other methods mentioned
  • Evaluation
    • Object Keypoint Similarity
    • COCO

Leave a Reply

Fill in your details below or click an icon to log in: Logo

You are commenting using your account. Log Out /  Change )

Facebook photo

You are commenting using your Facebook account. Log Out /  Change )

Connecting to %s