Study unveils AI-driven, real-time, hand-object pose estimation framework

Last updated: March 27, 2025 2:02 pm

By Celebrity Gig Add a Comment

Examples of estimated 3D poses on H2O dataset: For a separate example in each row, the figure represents (a) input RGB image, (b) our hand-object queries, (c) ground-truth contact map, (d) predicted contact map, and (e) final 3D pose estimation results, respectively. Credit: Ulsan National Institute of Science and Technology

A new AI-powered framework has been developed, offering new capabilities for the real-time analysis of two hands engaged in manipulating an object.

A research team led by Professor Seungryul Baek from the UNIST Artificial Intelligence Graduate School has introduced the Query-Optimized Real-Time Transformer (QORT-Former) framework, which accurately estimates the 3D poses of two hands and an object in real time.

The work was published on the arXiv preprint server and was presented at the Annual AAAI Conference on Artificial Intelligence (AAAI), Pennsylvania, USA.

Unlike previous methods that require substantial computational resources, QORT-Former achieves exceptional efficiency while maintaining state-of-the-art accuracy.

To optimize performance, the team proposed a novel query division strategy that enhances query features by leveraging contact information between the hands and the object, in conjunction with a three-step feature update within the transformer decoder. With only 108 queries and a single decoder, QORT-Former achieves 53.5 frames per second (FPS) on an RTX 3090 Ti GPU, making it the fastest known model for hand-object pose estimation.

Professor Seungryul Baek stated, “QORT-Former represents a significant advancement in the understanding of hand-object interactions. It not only enables real-time applications in augmented reality (AR), virtual reality (VR), and robotics, but also pushes the boundaries of real-time AI models.”

“Our work demonstrates that efficiency and accuracy can be optimized simultaneously,” co-first author Khalequzzaman Sayem remarked. “We anticipate broader adoption of our method in fields that require real-time hand-object interaction analysis.”

More information:
Elkhan Ismayilzada et al, QORT-Former: Query-optimized Real-time Transformer for Understanding Two Hands Manipulating Objects, arXiv (2025). DOI: 10.48550/arxiv.2502.19769

Journal information:
arXiv

Provided by
Ulsan National Institute of Science and Technology

Citation:
Study unveils AI-driven, real-time, hand-object pose estimation framework (2025, March 27)
retrieved 27 March 2025
from

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no
part may be reproduced without the written permission. The content is provided for information purposes only.

Top Stories

Nasdaq 100’s worst quarter in years sealed by AI bubble fears

₦50 Million and Counting: ‘Labake Olododo’ storms cinemas and tops the box office

5 Must-Watch Netflix series to binge this week

Stay Connected

Study unveils AI-driven, real-time, hand-object pose estimation framework

Leave a Reply Cancel reply

Content Safety

Trustworthy

Celebrity Gig Magazine

Related Stories

Agents project 100% clearing cost hike over subsidy removal

Alleged Silk Road hitman hit with narcotics, money laundering charges

Sam Bankman-Fried backtracks from $1 billion political donation quote

Stripe lays off 14% of workers

Bitcoin’s volatility falls below Nasdaq and S&P 500’s for first time since 2020

Europe gives Musk 24 hours to respond about Israel-Hamas war misinformation on X

How TikTok influencers are helping companies recruit new workers

Govt, UN to raise $5bn annually for poverty alleviation

About Us

Quick Links