Environment¶

Environments define learning problems. They can be used to execute behaviors and measure their performance.

Environment¶

The following table gives an overview of the environments that are provided by BOLeRo.

Environment name	Usecase
Objective Function	Benchmark functions for black-box optimization
OpenAI Gym	Benchmark problems for reinforcement learning
Optimum Trajectory	Optimize a trajectory to avoid obstacles and minimize the effort

Objective Function¶

Several benchmark functions to compare continuous black-box optimizers are provided by the environment ObjectiveFunction. The objective functions are the same as in the software COCO.

Examples using `bolero.environment.ObjectiveFunction`¶

Objective Function

OpenAI Gym¶

The environment OpenAiGym is a wrapper for OpenAI Gym environments.

Examples using `bolero.environment.OpenAiGym`¶

Discrete Problem

Cart Pole

Optimum Trajectory¶

The environment OptimumTrajectory is designed to use behavior learning algorithms for simple planning problems. The goal is to avoid obstacles and minimize the effort used for the trajectory, e.g. by minimizing the velocities or accelerations.

Examples using `bolero.environment.OptimumTrajectory`¶

Obstacle Avoidance

Contextual Environment¶

The following table gives an overview of the contextual environments that are provided by BOLeRo.

Environment name	Usecase
Contextual Objective Function	Contextual benchmark functions for contextual black-box optimization
Catapult	Benchmark problem for contextual policy search

Contextual Objective Function¶

Several contextual benchmark functions to compare continuous, contextual black-box optimizers are provided by the environment ContextualObjectiveFunction. The contextual objective functions are based on the functions that are provided with COCO.

Catapult¶

The Catapult environment is a benchmark problem for contextual policy search. It is a two-dimensional environment like the one displayed in the figur below. The goal is to hit the ground at a target specified on the x-axis. The target is given by the context vector.

Examples using `bolero.environment.Catapult`¶

Catapult Environment

Environment¶

Environment¶

Objective Function¶

Examples using bolero.environment.ObjectiveFunction¶

OpenAI Gym¶

Examples using bolero.environment.OpenAiGym¶

Optimum Trajectory¶

Examples using bolero.environment.OptimumTrajectory¶

Contextual Environment¶

Contextual Objective Function¶

Catapult¶

Examples using bolero.environment.Catapult¶

Examples using `bolero.environment.ObjectiveFunction`¶

Examples using `bolero.environment.OpenAiGym`¶

Examples using `bolero.environment.OptimumTrajectory`¶

Examples using `bolero.environment.Catapult`¶