Joni Pajarinen

research

∙ 09/19/2023

Monte-Carlo tree search with uncertainty propagation via optimal transport

This paper introduces a novel backup strategy for Monte-Carlo Tree Searc...

0 Tuan Dam, et al. ∙

research

∙ 09/05/2023

Sparse Function-space Representation of Neural Networks

Deep neural networks (NNs) are known to lack uncertainty estimates and s...

0 Aidan Scannell, et al. ∙

research

∙ 09/01/2023

Suicidal Pedestrian: Generation of Safety-Critical Scenarios for Autonomous Vehicles

Developing reliable autonomous driving algorithms poses challenges in te...

0 Yuhang Yang, et al. ∙

research

∙ 06/15/2023

Simplified Temporal Consistency Reinforcement Learning

Reinforcement learning is able to solve complex sequential decision-maki...

0 Yi Zhao, et al. ∙

research

∙ 03/05/2023

Seq2Seq Imitation Learning for Tactile Feedback-based Manipulation

Robot control for tactile feedback-based manipulation can be difficult d...

0 Wenyan Yang, et al. ∙

research

∙ 02/17/2023

Swapped goal-conditioned offline reinforcement learning

Offline goal-conditioned reinforcement learning (GCRL) can be challengin...

0 Wenyan Yang, et al. ∙

research

∙ 02/15/2023

Prioritized offline Goal-swapping Experience Replay

In goal-conditioned offline reinforcement learning, an agent learns from...

0 Wenyan Yang, et al. ∙

research

∙ 01/30/2023

Hierarchical Imitation Learning with Vector Quantized Models

The ability to plan actions on multiple levels of abstraction enables in...

0 Kalle Kujanpää, et al. ∙

research

∙ 11/14/2022

Redeeming Intrinsic Rewards via Constrained Optimization

State-of-the-art reinforcement learning (RL) algorithms typically use ra...

5 Eric Chen, et al. ∙

research

∙ 10/25/2022

Adaptive Behavior Cloning Regularization for Stable Offline-to-Online Reinforcement Learning

Offline reinforcement learning, by learning from a fixed dataset, makes ...

0 Yi Zhao, et al. ∙

research

∙ 10/04/2022

Continuous Monte Carlo Graph Search

In many complex sequential decision making tasks, online planning is cru...

0 Amin Babadi, et al. ∙

research

∙ 09/21/2022

Partially Observable Markov Decision Processes in Robotics: A Survey

Noisy sensing, imperfect control, and environment changes are defining c...

0 Mikko Lauri, et al. ∙

research

∙ 05/20/2022

Self-Paced Multi-Agent Reinforcement Learning

Curriculum reinforcement learning (CRL) aims to speed up learning of a t...

16 Wenshuai Zhao, et al. ∙

research

∙ 03/29/2022

Topological Experience Replay

State-of-the-art deep Q-learning methods update Q-values using state tra...

7 Zhang-Wei Hong, et al. ∙

research

∙ 02/28/2022

GPU-Accelerated Policy Optimization via Batch Automatic Differentiation of Gaussian Processes for Real-World Control

The ability of Gaussian processes (GPs) to predict the behavior of dynam...

0 Abdolreza Taheri, et al. ∙

research

∙ 02/11/2022

A Unified Perspective on Value Backup and Exploration in Monte-Carlo Tree Search

Monte-Carlo Tree Search (MCTS) is a class of methods for solving complex...

0 Tuan Dam, et al. ∙

research

∙ 01/24/2022

Adversarially Guided Subgoal Generation for Hierarchical Reinforcement Learning

Hierarchical reinforcement learning (HRL) proposes to solve difficult ta...

0 Vivienne Huiling Wang, et al. ∙

research

∙ 04/22/2021

Reinforcement Learning using Guided Observability

Due to recent breakthroughs, reinforcement learning (RL) has demonstrate...

0 Stephan Weigand, et al. ∙

research

∙ 03/23/2021

Neural Network Controller for Autonomous Pile Loading Revised

We have recently proposed two pile loading controllers that learn from h...

0 Wenyan Yang, et al. ∙

research

∙ 02/25/2021

A Probabilistic Interpretation of Self-Paced Learning with Applications to Reinforcement Learning

Across machine learning, the use of curricula has shown strong empirical...

21 Pascal Klink, et al. ∙

research

∙ 10/26/2020

POMDP Manipulation Planning under Object Composition Uncertainty

Manipulating unknown objects in a cluttered environment is difficult bec...

0 Joni Pajarinen, et al. ∙

research

∙ 09/04/2020

Technical Report: The Policy Graph Improvement Algorithm

Optimizing a partially observable Markov decision process (POMDP) policy...

0 Joni Pajarinen, et al. ∙

research

∙ 07/04/2020

Multi-Sensor Next-Best-View Planning as Matroid-Constrained Submodular Maximization

3D scene models are useful in robotics for tasks such as path planning, ...

2 Mikko Lauri, et al. ∙

research

∙ 07/01/2020

Convex Regularization in Monte-Carlo Tree Search

Monte-Carlo planning and Reinforcement Learning (RL) are essential to se...

0 Tuan Dam, et al. ∙

research

∙ 04/27/2020

Machine Learning Based Mobile Network Throughput Classification

Identifying mobile network problems in 4G cells is more challenging when...

1 Lauri Alho, et al. ∙

research

∙ 04/24/2020

Self-Paced Deep Reinforcement Learning

Generalization and reuse of agent behaviour across a variety of learning...

0 Pascal Klink, et al. ∙

research

∙ 03/08/2020

Deep Adversarial Reinforcement Learning for Object Disentangling

Deep learning in combination with improved training techniques and high ...

0 Melvin Laux, et al. ∙

research

∙ 02/26/2020

Probabilistic approach to physical object disentangling

Physically disentangling entangled objects from each other is a problem ...

0 Joni Pajarinen, et al. ∙

research

∙ 01/01/2020

Long-Term Visitation Value for Deep Exploration in Sparse Reward Reinforcement Learning

Reinforcement learning with sparse rewards is still an open challenge. C...

7 Simone Parisi, et al. ∙

research

∙ 11/01/2019

Generalized Mean Estimation in Monte-Carlo Tree Search

We consider Monte-Carlo Tree Search (MCTS) applied to Markov Decision Pr...

0 Tuan Dam, et al. ∙

research

∙ 08/15/2019

Model-based Lookahead Reinforcement Learning

Model-based Reinforcement Learning (MBRL) allows data-efficient learning...

0 Zhang-Wei Hong, et al. ∙

research

∙ 02/26/2019

Information Gathering in Decentralized POMDPs by Policy Graph Improvement

Decentralized policies for information gathering are required when multi...

0 Mikko Lauri, et al. ∙

research

∙ 02/07/2019

Compatible Natural Gradient Policy Search

Trust-region methods have yielded state-of-the-art results in policy sea...

0 Joni Pajarinen, et al. ∙

research

∙ 11/16/2018

An Algorithmic Perspective on Imitation Learning

As robots and other intelligent agents move from simple environments and...

0 Takayuki Osa, et al. ∙

Joni Pajarinen

Featured Co-authors

Sign in with Google

Consider DeepAI Pro