TensorStore for High-Performance, Scalable Array Storage

Favorite Posted by Jeremy Maitin-Shepard and Laramie Leavitt, Software Engineers, Connectomics at Google Many exciting contemporary applications of computer science and machine learning (ML) manipulate multidimensional datasets that span a single large coordinate system, for example, weather modeling from atmospheric measurements over a spatial grid or medical imaging predictions from

Read More
Shared by Google AI Technology September 22, 2022

View Synthesis with Transformers

Favorite Posted by Carlos Esteves and Ameesh Makadia, Research Scientists, Google Research A long-standing problem in the intersection of computer vision and computer graphics, view synthesis is the task of creating new views of a scene from multiple pictures of that scene. This has received increased attention [1, 2, 3]

Read More
Shared by Google AI Technology September 21, 2022

FindIt: Generalized Object Localization with Natural Language Queries

Favorite Posted by Weicheng Kuo and Anelia Angelova, Research Scientists, Google Research, Brain Team Natural language enables flexible descriptive queries about images. The interaction between text queries and images grounds linguistic meaning in the visual world, facilitating a better understanding of object relationships, human intentions towards objects, and interactions with

Read More
Shared by Google AI Technology September 20, 2022

Google at Interspeech 2022

Favorite Posted by Cat Armato, Program Manager, Google This week, the 23rd Annual Conference of the International Speech Communication Association (INTERSPEECH 2022) is being held in Incheon, South Korea, representing one of the world’s most extensive conferences on research and technology of spoken language understanding and processing. Over 2,000 experts

Read More
Shared by Google AI Technology September 17, 2022

Robust Online Allocation with Dual Mirror Descent

Favorite Posted by Santiago Balseiro, Staff Research Scientist, Google Research, and Associate Professor at Columbia University, and Vahab Mirrokni, Distinguished Scientist, Google Research The emergence of digital technologies has transformed decision making across commercial sectors such as airlines, online retailing, and internet advertising. Today, real-time decisions need to be repeatedly

Read More
Shared by Google AI Technology September 16, 2022

PaLI: Scaling Language-Image Learning in 100+ Languages

Favorite Posted by Xi Chen and Xiao Wang, Software Engineers, Google Research Advanced language models (e.g., GPT, GLaM, PaLM and T5) have demonstrated diverse capabilities and achieved impressive results across tasks and languages by scaling up their number of parameters. Vision-language (VL) models can benefit from similar scaling to address

Read More
Shared by Google AI Technology September 15, 2022

LOLNeRF: Learn from One Look

Favorite Posted by Daniel Rebain, Student Researcher, and Mark Matthews, Senior Software Engineer, Google Research, Perception Team An important aspect of human vision is our ability to comprehend 3D shape from the 2D images we observe. Achieving this kind of understanding with computer vision systems has been a fundamental challenge

Read More
Shared by Google AI Technology September 13, 2022

Learning to Walk in the Wild from Terrain Semantics

Favorite Posted by Yuxiang Yang, Student Researcher, Robotics at Google An important promise for quadrupedal robots is their potential to operate in complex outdoor environments that are difficult or inaccessible for humans. Whether it’s to find natural resources deep in the mountains, or to search for life signals in heavily-damaged

Read More
Shared by Google AI Technology September 9, 2022

A Multi-Axis Approach for Vision Transformer and MLP Models

Favorite Posted by Zhengzhong Tu and Yinxiao Li, Software Engineers, Google Research Convolutional neural networks have been the dominant machine learning architecture for computer vision since the introduction of AlexNet in 2012. Recently, inspired by the evolution of Transformers in natural language processing, attention mechanisms have been prominently incorporated into

Read More
Shared by Google AI Technology September 8, 2022