Visual captions: Using large language models to augment video conferences with dynamic visuals

Favorite Posted by Ruofei Du, Research Scientist, and Alex Olwal, Senior Staff Research Scientist, Google Augmented Reality Recent advances in video conferencing have significantly improved remote video communication through features like live captioning and noise cancellation. However, there are various situations where dynamic visual augmentation would be useful to better

Read More
Shared by Google AI Technology June 6, 2023

Retrieval-augmented visual-language pre-training

Favorite Posted by Ziniu Hu, Student Researcher, and Alireza Fathi, Research Scientist, Google Research, Perception Team Large-scale models, such as T5, GPT-3, PaLM, Flamingo and PaLI, have demonstrated the ability to store substantial amounts of knowledge when scaled to tens of billions of parameters and trained on large text and

Read More
Shared by Google AI Technology June 1, 2023

Large sequence models for software development activities

Favorite Posted by Petros Maniatis and Daniel Tarlow, Research Scientists, Google Software isn’t created in one dramatic step. It improves bit by bit, one little step at a time — editing, running unit tests, fixing build errors, addressing code reviews, editing some more, appeasing linters, and fixing more errors —

Read More
Shared by Google AI Technology May 31, 2023

Foundation models for reasoning on charts

Favorite Posted by Julian Eisenschlos, Research Software Engineer, Google Research Visual language is the form of communication that relies on pictorial symbols outside of text to convey information. It is ubiquitous in our digital life in the form of iconography, infographics, tables, plots, and charts, extending to the real world

Read More
Shared by Google AI Technology May 26, 2023

Differentially private clustering for large-scale datasets

Favorite Posted by Vincent Cohen-Addad and Alessandro Epasto, Research Scientists, Google Research, Graph Mining team Clustering is a central problem in unsupervised machine learning (ML) with many applications across domains in both industry and academic research more broadly. At its core, clustering consists of the following problem: given a set

Read More
Shared by Google AI Technology May 25, 2023

Google Research at I/O 2023

Favorite Posted by James Manyika, SVP Google Research and Technology & Society, and Jeff Dean, Chief Scientist, Google DeepMind and Google Research Wednesday, May 10th was an exciting day for the Google Research community as we watched the results of months and years of our foundational and applied work get

Read More
Shared by Google AI Technology May 25, 2023

3 new ways generative AI can help you search

Favorite Today, we’re starting to open up access to SGE (Search Generative Experience), one of our first experiments in Search Labs. View Original Source (blog.google/technology/ai/) Here.