java.lang.Object

neqsim.process.ml.RLEnvironment

All Implemented Interfaces:: Serializable

Direct Known Subclasses:: SeparatorLevelControlEnv

public class RLEnvironment extends Object implements Serializable

Reinforcement Learning environment wrapper for NeqSim process systems.

Provides a Gym-compatible interface for RL training on process control tasks. Key features:

Standardized observation and action spaces
Physics-grounded reward computation
Safe action projection via constraint manager
Episode management with reset capability

Usage Example:


ProcessSystem process = new ProcessSystem();
// ... build process ...

RLEnvironment env = new RLEnvironment(process);
env.addControlledEquipment("valve1", valve, actionSpace);
env.setRewardWeights(weights);

StateVector obs = env.reset();
while (!done) {
  ActionVector action = agent.selectAction(obs);
  StepResult result = env.step(action);
  obs = result.observation;
  done = result.done;
}

Version:

1.0

Author:

ESOL

See Also:

Nested Class Summary

Nested Classes

Modifier and Type

Class

Description

static class

RLEnvironment.StepInfo

Additional info from a step.

static class

RLEnvironment.StepResult

Result of a simulation step.
Field Summary

Fields

Modifier and Type

Field

Description

private final ActionVector

actionSpace

private final ConstraintManager

constraintManager

private double

currentTime

private boolean

done

private double

maxEpisodeTime

private final ProcessSystem

process

private static final long

serialVersionUID

private double

simulationTimeStep

private int

stepCount

private double

weightConstraintViolation

private double

weightEnergy

private double

weightSetpointError

private double

weightThroughput
Constructor Summary

Constructors

Constructor

Description

RLEnvironment(ProcessSystem process)

Create an RL environment wrapping a process system.
Method Summary

Modifier and Type

Method

Description

RLEnvironment

addConstraint(String name, String variableName, double minValue, double maxValue, String unit)

Add a hard constraint.

protected void

applyAction(ActionVector action)

Apply action to process equipment.

protected double

computeReward(StateVector state, ActionVector action, RLEnvironment.StepInfo info)

Compute reward for current state and action.

RLEnvironment

defineAction(String name, double lowerBound, double upperBound, String unit)

Define an action dimension.

ActionVector

getActionSpace()

Get the action space specification.

ConstraintManager

getConstraintManager()

Get the constraint manager.

double

getCurrentTime()

Get current simulation time.

protected StateVector

getObservation()

Get current observation.

ProcessSystem

getProcess()

Get the underlying process system.

int

getStepCount()

Get step count in current episode.

boolean

isDone()

Check if episode is done.

StateVector

reset()

Reset the environment to initial state.

RLEnvironment

setMaxEpisodeTime(double maxTime)

Set maximum episode time.

RLEnvironment

setRewardWeights(double energy, double setpointError, double constraintViolation, double throughput)

Set reward weights.

RLEnvironment

setTimeStep(double dt)

Set simulation time step.

RLEnvironment.StepResult

step(ActionVector action)

Execute one simulation step with given action.

Methods inherited from class Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait

Field Details
- serialVersionUID
  private static final long serialVersionUID
  
  See Also:
  
  Constant Field Values
- process
  
  private final ProcessSystem process
- constraintManager
  
  private final ConstraintManager constraintManager
- actionSpace
  
  private final ActionVector actionSpace
- simulationTimeStep
  
  private double simulationTimeStep
- currentTime
  
  private double currentTime
- maxEpisodeTime
  
  private double maxEpisodeTime
- weightEnergy
  
  private double weightEnergy
- weightSetpointError
  
  private double weightSetpointError
- weightConstraintViolation
  
  private double weightConstraintViolation
- weightThroughput
  
  private double weightThroughput
- done
  
  private boolean done
- stepCount
  
  private int stepCount
Constructor Details
- RLEnvironment
  
  public RLEnvironment(ProcessSystem process)
  
  Create an RL environment wrapping a process system.
  
  Parameters:
  
  process - the process system to control
Method Details
- defineAction
  
  public RLEnvironment defineAction(String name, double lowerBound, double upperBound, String unit)
  
  Define an action dimension.
  
  Parameters:
  
  name - action name
  
  lowerBound - minimum value
  
  upperBound - maximum value
  
  unit - physical unit
  
  Returns:
  
  this environment for chaining
- addConstraint
  
  public RLEnvironment addConstraint(String name, String variableName, double minValue, double maxValue, String unit)
  
  Add a hard constraint.
  
  Parameters:
  
  name - constraint name
  
  variableName - state variable to constrain
  
  minValue - minimum allowed
  
  maxValue - maximum allowed
  
  unit - physical unit
  
  Returns:
  
  this environment for chaining
- setRewardWeights
  
  public RLEnvironment setRewardWeights(double energy, double setpointError, double constraintViolation, double throughput)
  
  Set reward weights.
  
  Parameters:
  
  energy - weight for energy consumption (negative reward)
  
  setpointError - weight for setpoint deviation (negative reward)
  
  constraintViolation - weight for constraint violations (negative reward)
  
  throughput - weight for production throughput (positive reward)
  
  Returns:
  
  this environment for chaining
- setTimeStep
  
  public RLEnvironment setTimeStep(double dt)
  
  Set simulation time step.
  
  Parameters:
  
  dt - time step in seconds
  
  Returns:
  
  this environment for chaining
- setMaxEpisodeTime
  
  public RLEnvironment setMaxEpisodeTime(double maxTime)
  
  Set maximum episode time.
  
  Parameters:
  
  maxTime - maximum time in seconds
  
  Returns:
  
  this environment for chaining
- reset
  
  public StateVector reset()
  
  Reset the environment to initial state.
  
  Returns:
  
  initial observation
- step
  
  public RLEnvironment.StepResult step(ActionVector action)
  
  Execute one simulation step with given action.
  
  Parameters:
  
  action - control action to apply
  
  Returns:
  
  step result with observation, reward, done flag
- applyAction
  
  protected void applyAction(ActionVector action)
  
  Apply action to process equipment. Override in subclass to implement specific control logic.
  
  Parameters:
  
  action - the action to apply
- getObservation
  
  protected StateVector getObservation()
  
  Get current observation. Override in subclass to include equipment-specific states.
  
  Returns:
  
  current state vector
- computeReward
  
  protected double computeReward(StateVector state, ActionVector action, RLEnvironment.StepInfo info)
  
  Compute reward for current state and action.
  
  Parameters:
  
  state - current state
  
  action - applied action
  
  info - info object to fill with details
  
  Returns:
  
  scalar reward
- getActionSpace
  
  public ActionVector getActionSpace()
  
  Get the action space specification.
  
  Returns:
  
  action space
- getConstraintManager
  
  public ConstraintManager getConstraintManager()
  
  Get the constraint manager.
  
  Returns:
  
  constraint manager
- getProcess
  
  public ProcessSystem getProcess()
  
  Get the underlying process system.
  
  Returns:
  
  process system
- getCurrentTime
  
  public double getCurrentTime()
  
  Get current simulation time.
  
  Returns:
  
  time in seconds
- getStepCount
  
  public int getStepCount()
  
  Get step count in current episode.
  
  Returns:
  
  number of steps taken
- isDone
  
  public boolean isDone()
  
  Check if episode is done.
  
  Returns:
  
  true if episode finished

Class RLEnvironment

Usage Example:

Nested Class Summary

Field Summary

Constructor Summary

Method Summary

Methods inherited from class Object

Field Details

serialVersionUID

process

constraintManager

actionSpace

simulationTimeStep

currentTime

maxEpisodeTime

weightEnergy

weightSetpointError

weightConstraintViolation

weightThroughput

done

stepCount

Constructor Details

RLEnvironment

Method Details

defineAction

addConstraint

setRewardWeights

setTimeStep

setMaxEpisodeTime

reset

step

applyAction

getObservation

computeReward

getActionSpace

getConstraintManager

getProcess

getCurrentTime

getStepCount

isDone