IntBot bets the future of humanoids on social intelligence, not kung fu

IntBot bets the future of humanoids on social intelligence, not kung fu

Entrance Desk Data Concierge assisted GTC 2026 guests with navigation and occasion data. | Credit score: The Robotic Report

In simply over a 12 months, IntBot Inc. has gone from idea to full‑physique humanoids greeting 1000’s of friends at NVIDIA’s GTC and in lodge lobbies. The Sunnyvale, Calif.-based startup has used 24/7 interplay footage and sentiment evaluation to coach a social intelligence engine that sits on prime of off‑the‑shelf {hardware}.

At GTC 2026, CEO Lei Yang introduced that the company‘s IntEng “common social intelligence engine” now helps a number of humanoid and repair robots from totally different {hardware} distributors. He mentioned this marked a big step towards hardware-agnostic deployment of socially clever robots in real-world environments.

Full length image of the Intbot robot showing its feet and stand.

This full-length view of the IntBot humanoid exhibits its toes, though the robotic was stationary and secured to face for its shift on the GTC26 assist desk. | Credit score: The Robotic Report

IntBot additionally showcased the primary edge deployment of the NVIDIA Cosmos Reason-2 vision-language mannequin (VLM) inside its software program stack. Working immediately on robotic edge compute techniques, the mannequin allows robots to carry out real-time scene understanding, permitting them to interpret advanced human environments resembling crowded convention areas.

“The primary-generation robotic was a pre-programmed sort of motion. However for our robotic, in the event you have been at CES or watched the movies, all of the feelings are generated,” acknowledged Yang. “So even if you’re not speaking to the robotic, the robotic would reply with some very pure, very refined movement, simply the very facet nodding, to indicate ‘Okay, I’m listening,’ and even the sort of movement to point ‘I’m alive.’ Every thing is pushed by our social intelligence.”

Working immediately on robotic edge compute techniques, the mannequin allows robots to carry out real-time scene understanding, permitting them to interpret advanced human environments resembling crowded convention areas.

In keeping with Yang, ItBot’s robots use a type of audio-visual fusion, combining what it hears with what it sees, to raised perceive who within the scene is speaking and what the audio system’ intent is likely to be. This permits the robotic to offer a extra pure interplay with people.

IntBot stays platform-neutral

Whereas most humanoid startups chase ever-better locomotion and manipulation, IntBot is intentionally staying “{hardware} agnostic,” positioning its software program as a social intelligence layer that may journey on prime of no matter platforms the trade produces subsequent.

At the moment, that stack powers Nilo, a full-body humanoid that works 24/7 as a multilingual concierge in lodge lobbies from New York to Las Vegas, mixing on-device notion and body-language era with cloud LLMs for deeper queries.

“Proper now, we have already got three resorts throughout the U.S.,” mentioned Yang. “[We’re at] The Nap York in New York City, and a second one is known as Otonomous in Las Vegas, and the third one is a Marriott Lodge in Tulsa, Okla. And all of those three robots function 24/7, principally. They work alongside their human workers members, however IntBot gives add-on features to reinforce what the human workers, concierge, or [other] individuals can do.”

By focusing first on noisy, real-world environments like CES and busy lodge lobbies, the place earlier kiosk-style techniques and robots like Pepper stumbled, Lei Yang is betting that mastering pure, multi-party interplay would be the key to getting humanoids accepted as on a regular basis co-workers, not simply trade-show spectacles.



IntEngine coordinates notion, communication in actual time

Nylo’s skill to function autonomously on the GTC present flooring is powered by IntEngine, IntBot’s proprietary, multimodal, multi-loop social intelligence system. IntEngine fuses imaginative and prescient, audio, and language in actual time to coordinate speech, facial features, and gesture—enabling robots to understand social context and reply naturally.

This structure permits Nylo not simply to reply, however to determine when and the right way to interact, which Yang mentioned is an important functionality for working in open, public environments.

The publish IntBot bets the way forward for humanoids on social intelligence, not kung fu appeared first on The Robotic Report.