Key Points
- Motion capture records real-world movement and transfers it to 3D characters in your engine.
- Optical (marker-based) and inertial (suit-based) are the two main hardware categories.
- Data cleanup and retargeting are the most time-consuming steps — plan for them.
- Pre-made mocap packs like those from MoCap Online skip the capture stage entirely.
- FBX and BVH are the standard export formats supported by Unreal Engine and Unity.
What Is Motion Capture?
Motion capture (MoCap) is the process of recording real-world movement and translating it into digital animation data. Sensors, cameras, or algorithms track the position and rotation of a performer's body over time, generating a stream of 3D skeletal data that can be applied to any digital character rig. The result is animation that feels natural, fluid, and physically believable — qualities that are incredibly difficult to achieve by hand-keying every pose.
Used in blockbuster films, AAA video games, medical research, and virtual production, MoCap has become the backbone of modern character animation. With the rise of affordable hardware and pre-made animation packs, even solo developers and indie studios can now harness professional-grade motion data.
This motion capture tutorial covers everything from hardware setup to clean data export for game engines.
Types of Motion Capture Technology
1. Optical Motion Capture
Optical systems use multiple high-speed cameras placed around a stage to track reflective markers attached to a performer's body. Software triangulates the 3D position of each marker to reconstruct full-body movement. Systems like Vicon and Optitrack are industry gold standards. The downsides: high cost, large dedicated space, and significant setup time.
2. Inertial Motion Capture
Inertial suits, like those from Xsens and Rokoko, use IMU (Inertial Measurement Unit) sensors embedded in a wearable suit. They measure acceleration, rotation, and orientation in real time, requiring no cameras or fixed space. Inertial capture is portable and increasingly affordable — ideal for freelancers and small studios.
3. Markerless Motion Capture
Markerless systems use computer vision and AI to estimate pose from standard video footage — no suit, no markers required. Tools like Move.ai, DeepMotion, and even phone-based solutions fall into this category. Accuracy is improving rapidly, but for high-fidelity game or film work, marker-based systems still lead.
Hardware Requirements for Getting Started
The hardware you need depends on which capture method you choose:
- Optical: Minimum 6–8 cameras (Optitrack or Vicon), IR-reflective markers, capture stage (12×12 ft minimum), and a powerful workstation.
- Inertial: A full-body sensor suit (17–19 IMUs), a receiving hub, and a mid-range PC. Rokoko Smartsuit Pro II runs ~$3,000 for a full kit.
- Markerless: A decent webcam or smartphone, GPU-accelerated PC or cloud subscription, and relevant software.
- Pre-made packs: Zero hardware — download FBX or BIP files and apply them in your 3D app immediately.
Software Options for MoCap Workflows
Autodesk MotionBuilder
The industry-standard tool for cleaning, retargeting, and editing motion capture data. MotionBuilder's real-time playback and non-linear story editor make it indispensable for high-volume MoCap pipelines. Best suited for studios already in the Autodesk ecosystem.
Reallusion iClone
iClone offers an accessible entry point to MoCap-driven animation. It supports live inertial capture via Perception Neuron and Rokoko, provides built-in retargeting, and integrates tightly with Character Creator rigs. Excellent for indie animators and content creators.
Blender
Blender is a free, open-source 3D suite with strong BVH and FBX import support. Its NLA (Non-Linear Animation) editor allows layering and blending motion clips. With add-ons like Rokoko Studio Link, it can receive live MoCap data. Ideal for beginners on a budget. Browse our learning resources for animators to continue building your skills.
Unreal Engine & Unity
Both game engines have mature animation systems (Unreal's Control Rig and Animation Blueprint; Unity's Animator Controller) that can ingest FBX-based MoCap data and drive real-time characters.
The MoCap Workflow: Capture to Game Engine
- Pre-production: Define your animation list (shot list), cast your performer, calibrate your capture system.
- Capture session: Record takes with the performer acting out each motion. Log take names.
- Data cleanup: Remove noise, fill gaps, correct foot skating, and solve any marker swap artifacts in MotionBuilder or your capture software.
- Retargeting: Map the captured skeleton to your character's rig. Proportional differences require careful constraint setup.
- Editing & layering: Trim clips, blend transitions, and add secondary motion (finger detail, facial animation).
- Export: Export FBX or BVH at the target frame rate (30fps or 60fps for games).
- Engine integration: Import into Unreal or Unity, set up Animation Blueprints/State Machines, and test in context.
Common Beginner Mistakes
- Skipping calibration: Always calibrate your capture system before every session. Even 15 minutes of drift kills accuracy.
- Ignoring foot contact: Foot skating is the most common artifact. Use foot locking constraints during cleanup.
- Poor retargeting setup: Mismatched T-poses or bind poses between source and target skeletons cause wild deformations.
- Forgetting frame rate consistency: Mixing 30fps capture with 60fps game targets creates playback issues. Decide upfront.
- No performance direction: Garbage in, garbage out. A great capture of a bad performance is still a bad animation.
When to Use Pre-Made Animation Packs vs. Live Capture
Live capture makes sense when you need custom, one-of-a-kind performances that fit a specific narrative beat, have the budget and space for a session, or need volume (hundreds of unique animations). Pre-made packs — like those available at MoCap Online — are the smart choice when you need production-ready animations immediately, want to control cost, or need standard locomotion sets (walk, run, jump, idle) that don't require a unique performance. Many studios use both: packs for common animations, live sessions for hero moments.
Learning Resources
- Autodesk MotionBuilder documentation and tutorial library
- Rokoko Studio free tier + YouTube tutorials
- Blender's official animation docs + CG Cookie courses
- Unreal Engine's Animation documentation on Epic's learning portal
- MoCap Online blog — in-depth guides on retargeting, formats, and workflows
Frequently Asked Questions
Q: Can I learn motion capture without expensive hardware?
A: Absolutely. Start with markerless tools like DeepMotion or Move.ai, which work with standard webcams. Alternatively, buy pre-made FBX animation packs and learn retargeting and engine integration before investing in capture hardware.
Q: What file format should I use for game engines?
A: FBX is the universal standard and is supported by Unreal Engine, Unity, Blender, MotionBuilder, and most other tools. BVH is common in academic and research contexts. For Unreal specifically, FBX with baked animation is the safest import path.
Q: How long does it take to clean up a MoCap session?
A: Industry rule of thumb is 3–5x the recorded time. A one-hour capture session typically yields 3–5 hours of cleanup work in MotionBuilder. Pre-made packs bypass this entirely — they're delivered clean and ready to use.
Q: Do I need to match skeleton proportions for retargeting?
A: Not exactly, but T-pose alignment between source and target skeletons must be consistent. Most software (MotionBuilder, iClone, Unreal) has retargeting tools that handle proportion differences, but the better the bind pose match, the cleaner the result.
Engine-Specific MoCap Integration Tutorials
Each game engine handles motion capture data differently. Here are quick-start workflows for the most popular engines used with MoCap Online animation packs:
Unreal Engine 5: Import FBX files directly into your Content Browser. Set the skeleton to your project character or create a new one from the imported data. Use IK Retargeter to map bones between the MoCap Online skeleton and your custom character. Set up an Animation Blueprint with blend spaces for locomotion and montages for action clips.
Unity: Import FBX files and set the Animation Type to Humanoid in the import settings. Unity Avatar system auto-maps bones for standard humanoid rigs. Create an Animator Controller with states for each animation clip and configure transitions based on gameplay parameters.
Blender: Use our native .blend files for the cleanest integration. The animations import with proper armature data that works with Blender NLA Editor for animation layering and mixing. For FBX import, use the official FBX importer with default bone orientation settings.
For complete step-by-step tutorials with screenshots, visit our getting started guide. Our free Unreal Blueprints Demo provides a complete working project you can study and extend.
Motion Capture for Specific Disciplines: What's Different
Motion capture workflows vary significantly by application domain. What works for game character locomotion differs from film visual effects, medical biomechanics, or real-time digital human applications. Understanding these differences helps you configure your pipeline correctly.
Games vs. Film: Real-Time vs. Offline Rendering
Game MoCap prioritizes clean, loopable clips at 30–60fps with defined state boundaries. Film MoCap prioritizes continuous performance capture at high sample rates (120fps) with the full nuance of the actor's performance preserved. Game cleanup aggressively filters subtle noise that would be invisible at gameplay camera distances; film cleanup preserves more of the raw data because the offline renderer can use full-quality skeletal animation without performance constraints.
When sourcing MoCap data for games, verify the delivery format is game-ready: 30fps or 60fps, clean loops, and state boundaries clearly marked in the clip names. Raw film-format MoCap data requires additional processing for game use — it's not interchangeable without cleanup and resampling.
Biomechanical Research Applications
Medical and biomechanical MoCap has different requirements than entertainment. Accuracy is paramount — joint angles must be biomechanically correct to within a small margin. Optical systems with 8+ cameras and retroreflective markers remain the standard for research applications because their accuracy is documented and replicable. Inertial systems are acceptable for entertainment but below clinical accuracy thresholds.
For ergonomic analysis in industrial digital twins, the accuracy requirements are between research and entertainment. Joint angles must be correct enough to identify ergonomic risk factors, but don't require clinical-grade precision. Professional inertial systems (Xsens, Rokoko) are appropriate for this use case; consumer-grade solutions are not.
Real-Time Capture for Live Performance
Live performance applications — virtual production, VTubing, live broadcast — require MoCap systems that operate with sub-frame latency and need no post-processing. Optical tracking has inherent latency from camera processing; inertial systems can achieve lower latency with direct suit-to-output pipelines. For live performance, system latency below 30ms is the threshold for performer comfort — above that, the performer begins to see their avatar lag behind their own movement.
Real-time MoCap for live broadcast also requires graceful degradation: the system must continue delivering usable output when individual markers are temporarily occluded or sensors are briefly disturbed, rather than dropping frames or producing position spikes that would be visible to a live audience.
Motion Capture Learning Resources by Skill Level
The learning path for motion capture animation integration depends on whether the goal is consuming pre-captured data (importing and integrating existing libraries) or producing new capture data (running capture sessions and processing raw data). For integration-focused developers — the majority of game developers — the essential skills are engine-specific: Unreal Engine's Animation Blueprint and IK Retargeter, or Unity's Animator Controller and Avatar system. The best resources for these are Epic Games' official UE5 animation documentation and Unity's official Animator documentation combined with YouTube tutorials for practical workflow walkthroughs.
For production-focused developers who need to operate capture hardware and process raw data, the Rokoko and Xsens product documentation covers their respective hardware workflows. The CMU Graphics Lab tutorial archive covers the theoretical foundations of solving and retargeting that apply regardless of hardware platform. For developers integrating professional motion capture packs into their projects, MoCap Online's blog and product documentation provides engine-specific integration guides that bridge the gap between theory and implementation.
Motion capture animation extends beyond full-body performance — advanced motion capture systems can simultaneously record facial expressions using dedicated face-tracking cameras or marker rigs, enabling characters that communicate emotion through both movement and face in a single capture pass.
Summary
Learning motion capture does not require access to a professional volume. Understanding the pipeline — capture, cleanup, retargeting, and integration — is what matters most. Whether you are using raw mocap hardware or pre-built animation packs, the same principles apply. Focus on clean skeletons, proper retargeting, and building a library of reusable clips that serve your game's needs.

