Revisiting Mask Transformer from a Clustering Perspective

Favorite Posted by Qihang Yu, Student Researcher, and Liang-Chieh Chen, Research Scientist, Google Research Panoptic segmentation is a computer vision problem that serves as a core task for many real-world applications. Due to its complexity, previous work often divides panoptic segmentation into semantic segmentation (assigning semantic labels, such as “person”

Read More
Shared by Google AI Technology July 12, 2022

The 3 most dangerous words in Knowledge Management

Favorite There are three dangerous words you hear a lot when introducing KM. Here’s how to respond to them. Image from publicdomainpictures.net “We are different” Those are the three words, and they usually appear in this context. “Yes, I hear your stories and case histories about how KM adds value,

Read More
Shared by Nick Milton July 11, 2022

Onboard PaddleOCR with Amazon SageMaker Projects for MLOps to perform optical character recognition on identity documents

Favorite Optical character recognition (OCR) is the task of converting printed or handwritten text into machine-encoded text. OCR has been widely used in various scenarios, such as document electronization and identity authentication. Because OCR can greatly reduce the manual effort to register key information and serve as an entry step

Read More
Shared by AWS Machine Learning July 8, 2022

​​Deep Hierarchical Planning from Pixels

Favorite Posted by Danijar Hafner, Student Researcher, Google Research Research into how artificial agents can make decisions has evolved rapidly through advances in deep reinforcement learning. Compared to generative ML models like GPT-3 and Imagen, artificial agents can directly influence their environment through actions, such as moving a robot arm

Read More
Shared by Google AI Technology July 8, 2022

Enabling Creative Expression with Concept Activation Vectors

Favorite Posted by Been Kim, Research Scientist, Google Research, Brain Team, and Alison Lentz, Senior Staff Strategist, Google Research, Mural Team Advances in computer vision and natural language processing continue to unlock new ways of exploring billions of images available on public and searchable websites. Today’s visual search tools make

Read More
Shared by Google AI Technology July 7, 2022

An update on our work in responsible innovation

Favorite Over the last year, we’ve seen artificial intelligence (AI) systems advance our work in areas like inclusive product development and support for small businesses and job seekers. We’ve also seen its potential to be helpful in addressing major global needs — like forecasting and planning humanitarian responses to natural

Read More
Shared by Google AI Technology July 6, 2022