Today I’m going to share a little trick that will save you a ton of time when working with Kinect: how to properly measure the distance between the sensor and a body or object.
First, some good-to-know background.
How Kinect measures distances
As you know, Kinect integrates an infrared sensor, along with a depth processor. The visible area is called “field of view”. The depth processor produces depth frames. Each depth frame is a grid of 512×424 points. The Kinect SDK is then feeding the depth frames to a powerful body-detection algorithm. The algorithm identifies 25 human body joints and calculates their coordinates in the 3D space.
Every single joint has 3 values: X, Y, and Z. It is projected in a Cartesian coordinate system. The (0, 0, 0) point is the position of the sensor. Every other point is measured in terms of the position of the sensor! Check the overhead graphic below. It’s viewing the sensor and the scene from above.
- X is the position in the horizontal axis.
- Y is the position in the vertical axis.
- Z is the position in the depth axis.
Making sense of the Z value
X and Y are fairly easy to understand. That’s not true for Z. If you take a close look at the graphic above, you’ll notice that the Z value is not the linear distance between the point and the sensor. Instead, it’s the distance between the point and the plane of the sensor! It’s like having a virtual wall right in front of the Kinect. The Z value is the vertical distance from that wall (drawn in green).
To access the Z value, use the code below:
var distance = body.Joints[JointType.SpineBase].Position.Z;
Simple, huh? The “body” variable represents the tracked player. The “Position” property is a set of X, Y, and Z coordinates in the 3D space. The “Z” value is the distance between the player and the plane. I’ve used the SpineBase joint as a reference, since it’s the most reliably-tracked joint, and it’s also closer to the center of mass. You can read more about processing body data in my body-tracking tutorial.
Measuring the distance
How about the physical distance, though? The distance between the player and the device is represented by a mathematical vector (drawn in blue). Thankfully, we do not need to be gurus in Algebra to measure the length of a vector in the 3D space.
The length of a vector is given by the formula below:
Vitruvius implements this formula in its Length extension method:
var distance = point.Length();
This is it! You now know exactly how far or how close someone is to the Kinect sensor! Of course, if someone is standing right in front of the sensor, the Z value and the distance would converge.
Hint: in case you need to measure distances of points that do not belong to the human body, you can use Vitruvius Coordinate Mapping.
Summary
Kinect provides us with the X, Y, and Z coordinates of the human body joints.
- To measure the distance between the player and the plane of the sensor, we just use the Z value as-is.
- To measure the distance between the player and the sensor, we calculate the length of the corresponding vector.
Ready? Download Vitruvius now!
Get Vitruvius