java.lang.Object

neqsim.process.ml.GymEnvironment

All Implemented Interfaces:: Serializable

Direct Known Subclasses:: SeparatorGymEnv

public abstract class GymEnvironment extends Object implements Serializable

Gymnasium (OpenAI Gym) compatible environment interface for NeqSim.

This class provides a standardized interface compatible with Python's Gymnasium library, enabling seamless integration with popular RL frameworks like stable-baselines3, RLlib, and CleanRL.

Python Usage via JPype:


import jpype
from jpype import JClass

GymEnvironment = JClass('neqsim.process.ml.GymEnvironment')
env = MySeparatorEnv()  # extends GymEnvironment

obs = env.reset()
for _ in range(1000):
    action = agent.predict(obs)
    obs, reward, terminated, truncated, info = env.step(action)
    if terminated or truncated:
        obs = env.reset()

Version:

1.0

Author:

ESOL

See Also:

Nested Class Summary

Nested Classes

Modifier and Type

Class

Description

static class

GymEnvironment.ResetResult

Reset result matching Gymnasium API.

static class

GymEnvironment.StepResult

Step result matching Gymnasium API.
Field Summary

Fields

Modifier and Type

Field

Description

protected int

actionDim

protected double[]

actionHigh

protected double[]

actionLow

Action space bounds.

protected int

currentStep

Episode state.

protected String

envId

Environment metadata.

protected double

episodeReward

protected int

maxEpisodeSteps

protected int

observationDim

protected double[]

observationHigh

protected double[]

observationLow

Observation space bounds.

protected double

rewardThreshold

private static final long

serialVersionUID

protected boolean

terminated

protected boolean

truncated
Constructor Summary

Constructors

Constructor

Description

GymEnvironment()
Method Summary

Modifier and Type

Method

Description

protected double[]

clipAction(double[] action)

Clip action to valid bounds.

int

getActionDim()

Get action space dimension.

double[]

getActionHigh()

Get action space upper bounds.

double[]

getActionLow()

Get action space lower bounds.

int

getCurrentStep()

Get current episode step.

String

getEnvId()

Get environment ID.

double

getEpisodeReward()

Get cumulative episode reward.

int

getMaxEpisodeSteps()

Get maximum episode steps.

int

getObservationDim()

Get observation space dimension.

double[]

getObservationHigh()

Get observation space upper bounds.

double[]

getObservationLow()

Get observation space lower bounds.

boolean

isDone()

Check if environment is done (terminated or truncated).

GymEnvironment.ResetResult

reset()

Reset the environment to initial state.

GymEnvironment.ResetResult

reset(Long seed, Map<String,Object> options)

Reset the environment with optional seed and options.

protected abstract double[]

resetInternal(Map<String,Object> options)

Internal reset implementation.

void

setMaxEpisodeSteps(int maxSteps)

Set maximum episode steps.

protected void

setSeed(long seed)

Set random seed for reproducibility.

GymEnvironment.StepResult

step(double[] action)

Take a step in the environment.

protected abstract GymEnvironment.StepResult

stepInternal(double[] action)

Internal step implementation.

Methods inherited from class Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

Field Details
- serialVersionUID
  private static final long serialVersionUID
  
  See Also:
  
  Constant Field Values
- observationLow
  
  protected double[] observationLow
  
  Observation space bounds.
- observationHigh
  
  protected double[] observationHigh
- observationDim
  
  protected int observationDim
- actionLow
  
  protected double[] actionLow
  
  Action space bounds.
- actionHigh
  
  protected double[] actionHigh
- actionDim
  
  protected int actionDim
- envId
  
  protected String envId
  
  Environment metadata.
- maxEpisodeSteps
  
  protected int maxEpisodeSteps
- rewardThreshold
  
  protected double rewardThreshold
- currentStep
  
  protected int currentStep
  
  Episode state.
- episodeReward
  
  protected double episodeReward
- terminated
  
  protected boolean terminated
- truncated
  
  protected boolean truncated
Constructor Details
- GymEnvironment
  
  public GymEnvironment()
Method Details
- reset
  
  public GymEnvironment.ResetResult reset()
  
  Reset the environment to initial state.
  Gymnasium API: obs, info = env.reset()
  
  Returns:
  
  ResetResult with initial observation and info
- reset
  
  public GymEnvironment.ResetResult reset(Long seed, Map<String,Object> options)
  
  Reset the environment with optional seed and options.
  
  Parameters:
  
  seed - random seed for reproducibility (nullable)
  
  options - additional reset options (nullable)
  
  Returns:
  
  ResetResult with initial observation and info
- step
  
  public GymEnvironment.StepResult step(double[] action)
  
  Take a step in the environment.
  Gymnasium API: obs, reward, terminated, truncated, info = env.step(action)
  
  Parameters:
  
  action - action array (continuous values)
  
  Returns:
  
  StepResult with next observation, reward, termination flags, and info
- clipAction
  
  protected double[] clipAction(double[] action)
  
  Clip action to valid bounds.
  
  Parameters:
  
  action - raw action
  
  Returns:
  
  clipped action
- resetInternal
  
  protected abstract double[] resetInternal(Map<String,Object> options)
  
  Internal reset implementation. Override in subclass.
  
  Parameters:
  
  options - reset options
  
  Returns:
  
  initial observation
- stepInternal
  
  protected abstract GymEnvironment.StepResult stepInternal(double[] action)
  
  Internal step implementation. Override in subclass.
  
  Parameters:
  
  action - clipped action
  
  Returns:
  
  step result
- setSeed
  
  protected void setSeed(long seed)
  
  Set random seed for reproducibility.
  
  Parameters:
  
  seed - random seed
- getObservationDim
  
  public int getObservationDim()
  
  Get observation space dimension.
  
  Returns:
  
  observation dimension
- getActionDim
  
  public int getActionDim()
  
  Get action space dimension.
  
  Returns:
  
  action dimension
- getObservationLow
  
  public double[] getObservationLow()
  
  Get observation space lower bounds.
  
  Returns:
  
  lower bounds array
- getObservationHigh
  
  public double[] getObservationHigh()
  
  Get observation space upper bounds.
  
  Returns:
  
  upper bounds array
- getActionLow
  
  public double[] getActionLow()
  
  Get action space lower bounds.
  
  Returns:
  
  lower bounds array
- getActionHigh
  
  public double[] getActionHigh()
  
  Get action space upper bounds.
  
  Returns:
  
  upper bounds array
- getEnvId
  
  public String getEnvId()
  
  Get environment ID.
  
  Returns:
  
  environment identifier
- getMaxEpisodeSteps
  
  public int getMaxEpisodeSteps()
  
  Get maximum episode steps.
  
  Returns:
  
  max steps
- setMaxEpisodeSteps
  
  public void setMaxEpisodeSteps(int maxSteps)
  
  Set maximum episode steps.
  
  Parameters:
  
  maxSteps - max steps
- isDone
  
  public boolean isDone()
  
  Check if environment is done (terminated or truncated).
  
  Returns:
  
  true if episode ended
- getCurrentStep
  
  public int getCurrentStep()
  
  Get current episode step.
  
  Returns:
  
  step count
- getEpisodeReward
  
  public double getEpisodeReward()
  
  Get cumulative episode reward.
  
  Returns:
  
  total reward

Class GymEnvironment

Python Usage via JPype:

Nested Class Summary

Field Summary

Constructor Summary

Method Summary

Methods inherited from class Object

Field Details

serialVersionUID

observationLow

observationHigh

observationDim

actionLow

actionHigh

actionDim

envId

maxEpisodeSteps

rewardThreshold

currentStep

episodeReward

terminated

truncated

Constructor Details

GymEnvironment

Method Details

reset

reset

step

clipAction

resetInternal

stepInternal

setSeed

getObservationDim

getActionDim

getObservationLow

getObservationHigh

getActionLow

getActionHigh

getEnvId

getMaxEpisodeSteps

setMaxEpisodeSteps

isDone

getCurrentStep

getEpisodeReward