Assembly Magazine logo
search
Ask ASSEMBLY AI
cart
facebook twitter linkedin youtube
  • Sign In
  • Create Account
  • Sign Out
  • My Account
Assembly Magazine logo
  • TRENDS
    • Ask ASSEMBLY AI
    • Trends
    • News
    • New Products
  • INDUSTRIES
    • Aerospace
    • Appliance
    • DFMA Assembly
    • Medical Devices
    • Green Manufacturing
    • Lean Manufacturing
    • Machinery Assembly
    • Electronics Assembly
    • Automotive
  • TECHNOLOGIES
    • Adhesives & Dispensing
    • Assembly Presses
    • Automated Assembly Systems
    • Manufacturing Management
    • Manufacturing Software
    • Motion Control
    • Screwdriving & Riveting
    • Robotics
    • Test & Inspection
    • Plastics & Metal Welding
    • Wire Processing
    • Workstations
  • AUTONOMOUS & ELECTRIC MOBILITY
    • AEM Magazine Archives
    • Autonomy
    • Electrification
    • Mobility Services
    • Assembly & Testing
    • AV/EM News
  • MEDIA
    • Ask ASSEMBLY AI
    • Podcasts
    • Assembly News Now
    • Assembly TV
    • Webinars
    • eBooks
  • EVENTS
    • Calendar
    • The ASSEMBLY Show
  • MORE
    • Exclusives >
      • Plant of the Year
      • Capital Spending
    • Buyers Guide >
      • Supplier Insights
    • Classifieds
    • Featured Products
    • Newsletters
    • Store
    • White Papers
    • Columns
    • Sponsor Insights
  • INFOCENTER
    • Assembly & Test Solutions
  • EMAGAZINE
    • eMagazine
    • Archive Issues
    • Advertise
    • Contact Us
    • Sign Up
Assembly Breaking News Manufacturing SoftwareRobotics Assembly

Robotics

Multiple AI Models Help Robots Execute Complex Plans

A multimodal system uses models trained on language, vision and action data to help robots develop and execute plans for household, construction and manufacturing tasks.

MIT AI software for robots
January 9, 2024

CAMBRIDGE, MA—Your daily to-do list is likely pretty straightforward: wash the dishes, buy groceries, and other minutiae. It’s unlikely you wrote out “pick up the first dirty dish,” or “wash that plate with a sponge,” because each of these miniature steps within the chore feels intuitive. While we can routinely complete each step without much thought, a robot requires a complex plan that involves more detailed outlines.

MIT’s Improbable AI Lab, a group within the Computer Science and Artificial Intelligence Laboratory (CSAIL), has offered these machines a helping hand with a new multimodal framework: Compositional Foundation Models for Hierarchical Planning (HiP), which develops detailed, feasible plans with the expertise of three different foundation models. Like OpenAI’s GPT-4, the foundation model that ChatGPT and Bing Chat were built upon, these foundation models are trained on massive quantities of data for applications like generating images, translating text, and robotics.

Unlike RT2 and other multimodal models that are trained on paired vision, language, and action data, HiP uses three different foundation models each trained on different data modalities. Each foundation model captures a different part of the decision-making process and then works together when it’s time to make decisions. HiP removes the need for access to paired vision, language, and action data, which is difficult to obtain. HiP also makes the reasoning process more transparent.

What’s considered a daily chore for a human can be a robot’s “long-horizon goal”—an overarching objective that involves completing many smaller steps first—requiring sufficient data to plan, understand, and execute objectives. While computer vision researchers have attempted to build monolithic foundation models for this problem, pairing language, visual and action data is expensive. Instead, HiP represents a different, multimodal recipe: a trio that cheaply incorporates linguistic, physical, and environmental intelligence into a robot.

“Foundation models do not have to be monolithic,” says NVIDIA AI researcher Jim Fan, who was not involved in the paper. “This work decomposes the complex task of embodied agent planning into three constituent models: a language reasoner, a visual world model, and an action planner. It makes a difficult decision-making problem more tractable and transparent.”

The team believes that their system could help these machines accomplish household chores, such as putting away a book or placing a bowl in the dishwasher. Additionally, HiP could assist with multistep construction and manufacturing tasks, like stacking and placing different materials in specific sequences.

The CSAIL team tested HiP’s acuity on three manipulation tasks, outperforming comparable frameworks. The system reasoned by developing intelligent plans that adapt to new information.

Looking for quick answers on assembly and manufacturing topics? Try Ask ASM, our new smart AI search tool. Ask ASM →

First, the researchers requested that it stack different-colored blocks on each other and then place others nearby. The catch: Some of the correct colors weren’t present, so the robot had to place white blocks in a color bowl to paint them. HiP often adjusted to these changes accurately, especially compared to state-of-the-art task planning systems like Transformer BC and Action Diffuser, by adjusting its plans to stack and place each square as needed.

Another test: arranging objects such as candy and a hammer in a brown box while ignoring other items. Some of the objects it needed to move were dirty, so HiP adjusted its plans to place them in a cleaning box, and then into the brown container. In a third demonstration, the bot was able to ignore unnecessary objects to complete kitchen sub-goals such as opening a microwave, clearing a kettle out of the way, and turning on a light. Some of the prompted steps had already been completed, so the robot adapted by skipping those directions.

For more information on the study, click here.

KEYWORDS: Artificial Intelligence (AI)

Share This Story

Looking for a reprint of this article?
From high-res PDFs to custom plaques, order your copy today!

Recommended Content

JOIN TODAY
To unlock your recommendations.

Already have an account? Sign In

  • Made in the U.S.A.

    Consumer Products Manufacturing: Made in the USA

    Supply chain lessons learned during the coronavirus...
    Automated Assembly Systems
    By: Austin Weber
  • Best Practices for Press-Fit Assembly

    Best Practices for Press-Fit Assembly

    In manufacturing, ironclad formulas for success are hard...
    Assembly Presses
    By: Jim Camillo
  • aem0523leader-tesla1.jpg

    Tesla Rethinks the Assembly Line

    Engineers at Tesla Inc. have developed a new process that...
    Automotive Assembly
    By: Austin Weber
Manage My Account
  • eMagazine Subscription
  • Assembly Newsletters
  • Online Registration
  • Subscription Customer Service
  • Manage My Preferences

More Videos

Sponsored Content

Sponsored Content is a special paid section where industry companies provide high quality, objective, non-commercial content around topics of interest to the ASSEMBLY audience. All Sponsored Content is supplied by the advertising company and any opinions expressed in this article are those of the author and not necessarily reflect the views of ASSEMBLY or its parent company, BNP Media. Interested in participating in our Sponsored Content section? Contact your local rep!

close
  • ultrasonic welding
    Sponsored bySonobond Ultrasonics

    Engineering Efficiency in High-Performance Assembly: How Ultrasonic Welding Enhances Throughput, Reliability and Quality

  • UV curing system
    Sponsored byDymax

    Why UV Intensity Alone Doesn’t Define Curing Performance

  • wooden pallets
    Sponsored byLEAN Manufacturing Products

    Eliminating Waste on the Shop Floor: Applying Lean Principles to Improve Manufacturing Efficiency

Popular Stories

Ferrari

Ferrari Unveils Four-Door EV

ASSEMBLY News Now, episode-30: Volvo Redesigns EV Manufacturing

Volvo Redesigns EV Manufacturing

automated consumer goods assembly system

Best Practices for Cycle Time Optimization

Watch the latest episode of ANN now!

Events

July 24, 2025

From Shop Floor to CFO: How Manufacturers Are Closing the Loop Between Operations and Finance

On Demand Learn how manufacturers are bridging the gap between the shop floor and ERP systems to gain real-time visibility, streamline operations, and kick-start digital transformation—without waiting years.

Sponsored by:

PicoStratusGreen
July 30, 2025

Buffer Analysis and Design Fundamentals for Manufacturing Excellence

On Demand In this presentation, Dr. Herman Tang shares practical insights from his industry experience and research on buffer management in manufacturing operations.

View All Submit An Event

Poll

Difficult Assembly Processes

Which assembly process gives you the most difficulty?
View Results Poll Archive

Products

Manufacturing Cost Policy Deployment (MCPD) Profitability Scenarios: Systematic and Systemic Improvement of Manufacturing Costs

Manufacturing Cost Policy Deployment (MCPD) Profitability Scenarios: Systematic and Systemic Improvement of Manufacturing Costs

See More Products
Register for webinar - Modernizing Automotive Assembly: Why Upgrading Legacy MES is a Business Imperative

Related Articles

  • Researchers Help Robots Solve Complex Assembly Problems

    Researchers Help Robots Solve Complex Assembly Problems

    See More
  • X-Y-Z: Vision Systems Help Robots Find, Assemble Parts

    See More
  • robot performing a dance routine

    Dance Moves Help Humanoid Robots Work Better With Humans

    See More

Related Products

See More Products
  • testing.jpg

    Testing Complex and Embedded Systems

  • project management.jpg

    Project Management of Complex and Embedded Systems

  • Robotic Micro-Assembly

See More Products

Related Directories

  • Ergo-Help Inc.

×

Never miss the latest news and trends driving the manufacturing industry

Stay in the know on the latest assembly trends.

JOIN TODAY!
  • RESOURCES
    • Advertise
    • Contact Us
    • Directories
    • Manufacturing Division
    • Store
    • Want More?
  • SIGN UP TODAY
    • Create Account
    • eMagazine
    • Newsletters
    • Customer Service
    • Manage Preferences
  • SERVICES
    • Marketing Services
    • Reprints
    • Market Research
    • List Rental
    • Survey/Respondent Access
  • STAY CONNECTED
    • LinkedIn
    • Facebook
    • Instagram
    • YouTube
    • X (Twitter)
  • PRIVACY
    • PRIVACY POLICY
    • TERMS & CONDITIONS
    • DO NOT SELL MY PERSONAL INFORMATION
    • PRIVACY REQUEST
    • ACCESSIBILITY

Copyright ©2026. All Rights Reserved BNP Media, Inc. and BNP Media II, LLC.

Design, CMS, Hosting & Web Development :: ePublishing