Deepmind – Page 15

From LEGO competitions to DeepMind’s robotics lab

May 19, 2022

by Deepmind

If you want to be at DeepMind, go for it. Apply, interview, and just try. You might not get it the first time but that doesn’t mean you can’t try again. I never thought DeepMind would accept me, and when they did, I thought it was a mistake. Everyone doubts themselves – I’ve never felt like the smartest person in the room. I’ve often felt the opposite. But I’ve learned that, despite those feelings, I do belong and I do deserve to work at a place like this. And that journey, for me, started with just trying.Read More

From LEGO competitions to DeepMind’s robotics lab

May 19, 2022

by Deepmind

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

May 16, 2022

by Deepmind

In our recent paper, we explore how populations of deep reinforcement learning (deep RL) agents can learn microeconomic behaviours, such as production, consumption, and trading of goods. We find that artificial agents learn to make economically rational decisions about production, consumption, and prices, and react appropriately to supply and demand changes.Read More

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

May 16, 2022

by Deepmind

A Generalist Agent

May 12, 2022

by Deepmind

Inspired by progress in large-scale language modelling, we apply a similar approach towards building a single generalist agent beyond the realm of text outputs. The agent, which we refer to as Gato, works as a multi-modal, multi-task, multi-embodiment generalist policy. The same network with the same weights can play Atari, caption images, chat, stack blocks with a real robot arm and much more, deciding based on its context whether to output text, joint torques, button presses, or other tokens.Read More

A Generalist Agent

May 12, 2022

by Deepmind

Active offline policy selection

May 6, 2022

by Deepmind

To make RL more applicable to real-world applications like robotics, we propose using an intelligent evaluation procedure to select the policy for deployment, called active offline policy selection (A-OPS). In A-OPS, we make use of the prerecorded dataset and allow limited interactions with the real environment to boost the selection quality.Read More

Active offline policy selection

May 6, 2022

by Deepmind

Tackling multiple tasks with a single visual language model

April 28, 2022

by Deepmind

We introduce Flamingo, a single visual language model (VLM) that sets a new state of the art in few-shot learning on a wide range of open-ended multimodal tasks.Read More

When a passion for bass and brass help build better tools

April 28, 2022

by Deepmind

We caught up with Kevin Millikin, a software engineer on the DevTools team. He’s in Salt Lake City this week to present at PyCon US, the largest annual gathering for those using and developing the open-source Python programming language.Read More

Vedere AI

Posts in category: Deepmind

From LEGO competitions to DeepMind’s robotics lab

From LEGO competitions to DeepMind’s robotics lab

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

Emergent Bartering Behaviour in Multi-Agent Reinforcement Learning

A Generalist Agent

A Generalist Agent

Active offline policy selection

Active offline policy selection

Tackling multiple tasks with a single visual language model

When a passion for bass and brass help build better tools

Navigation

Computer Vision Endless Possibilities

"I'm interested in things that change the world or that affect the future and wondrous, new technology where you see it, and you're like, 'Wow, how did that even happen? How is that possible?'" -- Elon Musk

Copyright © 2019-2023 Vedere AI. All Rights Reserved.