Warehouse Automation

View Original

Meta AI unveils new AI tool to identify objects in images and videos

Meta's image segmentation model could have a lot of applications, including in AR and VR.


When it comes to generative AI news, Meta has taken somewhat of a backseat as of late. However, on Wednesday, the tech giant unveiled research on image segmentation that could have a large impact on the future of AI. 

Image segmentation simply refers to an AI model that can identify different items in a photo. For example, in a photo of a box of fruit, using image segmentation, an AI model would be able to identify each individual fruit photographed and the box, as seen by the Meta demo.

Meta's Segment Anything project includes a new task, dataset, and model for image segmentation that aims to "democratize segmentation," according to Meta. 

Meta released both its general Segment Anything Model (SAM) and its largest ever segmentation dataset, Segment Anything 1-Billion mask dataset (SA-1B). The dataset has over 1 billion masks on 11 million licensed and privacy-respecting images.

Meta says it released (SAM) and (SA-1B) "to enable a wide range of applications and promote further research into foundation models for computer vision."

Image segmentation can be used for photo editing, scientific imagery analysis, larger AI systems that require a general multimodal understanding of the world, and most interestingly, AR and VR. 


Previous segmentation models either required an individual to guide it through interactive segmentation or training based on substantial amounts of manually annotated objects for automatic segmentation.

SAM is a single model that can easily perform either segmentation method. It means practitioners no longer have to collect their own segmentation data and also eliminates the need to fine-tune a model for their use case, saving both time and effort. 

You can test out the technology by visiting the Segment Anything demo site and uploading your own image or utilizing a photo in the gallery. 

Link to full story on Meta AI >


See this gallery in the original post