Files

T

crosstyan 502a90761b docs(readme): rewrite repo status and fork context

Rewrite the top-level README to describe the current Python-first package, the RGB-D pipeline ported from SimpleDepthPose, and the main differences from upstream RapidPoseTriangulation.

2026-03-26 13:14:06 +08:00

6.5 KiB

Raw Blame History

RapidPoseTriangulation

Fast multi-view multi-person pose reconstruction, packaged as a Python-first C++ library.

This repository started from RapidPoseTriangulation and has since been refactored into a slimmer library-focused fork. The current repo keeps the triangulation core, exposes it through nanobind, and adds an RGB-D reconstruction path inspired by SimpleDepthPose.

What This Repository Is Now

A packaged library named rapid-pose-triangulation with Python bindings under rpt
A C++ core built with scikit-build-core and nanobind
A triangulation library for calibrated multi-view 2D detections
An RGB-D reconstruction helper layer that samples aligned depth, applies joint offsets, lifts poses into world coordinates, and merges per-view proposals

Current package status:

Python >=3.10
NumPy runtime dependency
Current version: 0.2.0

Current Capabilities

The public Python API exposed by rpt currently includes:

Camera/config helpers: make_camera, convert_cameras, make_triangulation_config
Input preparation: pack_poses_2d
Triangulation: triangulate_poses, triangulate_debug, triangulate_with_report
Tracking/debug helpers: filter_pairs_with_previous_poses
RGB-D helpers: sample_depth_for_poses, apply_depth_offsets, lift_depth_poses_to_world, merge_rgbd_views, reconstruct_rgbd

At a high level there are now two supported reconstruction paths:

Multi-view RGB triangulation from calibrated 2D detections
Multi-view RGB-D reconstruction from calibrated 2D detections plus aligned depth images

Installation And Development Workflow

Clone the repo and use uv for local development:

git clone https://git.weihua-iot.cn/crosstyan/RapidPoseTriangulation.git
cd RapidPoseTriangulation
uv sync --group dev

Run the test suite:

uv run pytest -q

Build source and wheel artifacts:

uv build

run_container.sh is still present in the repo, but it is a leftover helper script rather than the primary or best-supported development workflow.

Python API Overview

Typical triangulation flow:

import numpy as np
import rpt

cameras = rpt.convert_cameras(raw_cameras)
config = rpt.make_triangulation_config(
    cameras,
    roomparams=np.asarray([[5.6, 6.4, 2.4], [0.0, -0.5, 1.2]], dtype=np.float32),
    joint_names=joint_names,
)

poses_2d, person_counts = rpt.pack_poses_2d(views, joint_count=len(joint_names))
poses_3d = rpt.triangulate_poses(poses_2d, person_counts, config)

Typical RGB-D flow:

poses_2d, person_counts = rpt.pack_poses_2d(views, joint_count=len(joint_names))
poses_3d = rpt.reconstruct_rgbd(
    poses_2d,
    person_counts,
    depth_images,
    config,
    use_depth_offsets=True,
)

The lower-level RGB-D helpers are also available if you want to inspect or customize the intermediate steps:

sample_depth_for_poses: sample aligned depth around visible 2D joints
apply_depth_offsets: add per-joint offsets derived from SimpleDepthPose
lift_depth_poses_to_world: convert [u, v, d, score] joints into world-space [x, y, z, score]
merge_rgbd_views: merge per-view world-space pose proposals into final poses

Ported From SimpleDepthPose

This fork ports the RGB-D fusion path from the sibling SimpleDepthPose project into rpt.

The ported pieces are:

Depth sampling around each visible 2D joint, based on the add_depth preprocessing flow
Per-joint depth offsets, matching the SimpleDepthPose body-surface correction idea
UVD-to-world lifting using the calibrated camera intrinsics/extrinsics
Multi-view RGB-D pose fusion logic adapted from PoseFuser

Compared with the original SimpleDepthPose implementation, the port here has been changed to fit a reusable library:

The workflow is exposed as stateless functions instead of script-driven pipelines
The fusion logic lives in the rpt core instead of a separate wrapper class
Camera and scene configuration are routed through TriangulationConfig
The RGB-D path is covered by repo tests and packaged with the same Python API as the triangulation path

This repo does not attempt to port the full SimpleDepthPose project. It only ports the RGB-D reconstruction pieces that fit the current library scope.

Changed Vs Upstream RapidPoseTriangulation

Compared with the upstream repository at gitlab.com/Percipiote/RapidPoseTriangulation, this fork has materially changed structure and scope:

SWIG bindings were replaced with nanobind
The repo was converted into a Python package under src/rpt
The triangulation interface was simplified around immutable cameras and config structs
The core was reshaped into a more library-oriented, zero-copy style API
Debug tracing and tracked association reports were added
Upstream integration layers and extra tooling were removed, including the old extras/ stack and related deployment/inference wrappers
An RGB-D reconstruction pipeline was added by porting and adapting parts of SimpleDepthPose

In practice, upstream is closer to a larger project tree with integrations and historical tooling, while this fork is closer to a compact reconstruction library.

Testing

The repo currently ships Python-facing tests for both triangulation and RGB-D reconstruction:

uv run pytest tests/test_interface.py
uv run pytest tests/test_rgbd.py

Or run everything:

uv run pytest -q

The checked-in sample data under data/ is used by the triangulation tests.

Please cite RapidPoseTriangulation if you use the triangulation core:

@article{
  rapidtriang,
  title={{RapidPoseTriangulation: Multi-view Multi-person Whole-body Human Pose Triangulation in a Millisecond}},
  author={Bermuth, Daniel and Poeppel, Alexander and Reif, Wolfgang},
  journal={arXiv preprint arXiv:2503.21692},
  year={2025}
}

The RGB-D path in this fork is based on ideas ported from SimpleDepthPose:

@article{
  simdepose,
  title={{SimpleDepthPose: Fast and Reliable Human Pose Estimation with RGBD-Images}},
  author={Bermuth, Daniel and Poeppel, Alexander and Reif, Wolfgang},
  journal={arXiv preprint arXiv:2501.18478},
  year={2025}
}

6.5 KiB Raw Blame History