If you are searching for literature use the terms "economic dispatch" and "security constrained economic dispatch". Yes there are papers behind firewalls, but there are numerous textbooks that are not.
In the old days, the optimum solution was simple, we simple dispatched according to equal incremental costs. That was simple and effective. Today, we optimize systems of linear programming using the
Simplex Algorithm. Custom software sets up the equations (on the order of 250000 simultaneous equations and constraints) and commercial solver engines calculate the optimum.
The full problem includes multiple optimum solutions looking forward in time different periods. For example, thermal power plants need to know if they will be needed tomorrow so that the appropriate staff can be told to come to work. Getting them started, warmed up and ready takes 4-18 hours. Therefore, we make one optimum plan for a whole day 24 hours in advance. Then in the actual day, once every hour we make a new optimum solution for the deviations between reality and the day-ahead solution. Then every 5 minutes we make another optimum solution for the deviations between reality and the hour-ahead solution. The actual closed-loop control uses the 5 minute optimum solution, but feedbacks and control actions are based on frequency and power flows only; no more economics.
In all those cases, the actual capabilities of each component must be modeled including startup/shutdown times, ramp rates, weather (for solar/wind), and capacity limits (batteries, flywheels, hydro). In addition, the capacity of the transmission system to securely transport power from every point A to every other point B is modeled. In some cases, laws and regulations must also be modeled.
Of course, for the purposes of learning you can start much simpler. But I would describe it as an optimization problem, not optimal control.
Good luck
You may enjoy this PF Insights article.
https://www.physicsforums.com/insights/what-happens-when-you-flip-the-light-switch/