How to Evaluate Model Performance for Simulated Power Demand?

  • Context: Undergrad 
  • Thread starter Thread starter bradyj7
  • Start date Start date
  • Tags Tags
    Error Model
Click For Summary

Discussion Overview

The discussion focuses on evaluating the performance of a model that simulates power demand from a fleet of electric cars over a 24-hour period. Participants explore methods for quantifying model accuracy and the implications of various statistical measures.

Discussion Character

  • Technical explanation
  • Mathematical reasoning
  • Debate/contested

Main Points Raised

  • One participant seeks advice on how to quantify the performance of their power demand simulation model, questioning whether to use RMSE or another metric due to daily variations in actual demand.
  • Another participant suggests considering the mean squared deviation between the actual demand (blue line) and the simulated demands (black lines) as a time-dependent function.
  • A participant clarifies that the blue line represents the average power demand, while the black lines are generated by their model, prompting a discussion about the reliability of the model's output.
  • Questions arise about the equivalence of mean squared deviation and mean square error, with participants discussing the computation of differences between the blue and black lines.
  • One participant proposes averaging the mean squared deviations over the 100 black lines to assess overall model performance.
  • Another participant emphasizes the importance of checking for time-dependence in the results and suggests that this can be visualized in a graph.

Areas of Agreement / Disagreement

Participants express differing views on the best approach to quantify model performance, particularly regarding the use of mean squared deviation versus mean square error, and whether to average over multiple simulations. The discussion remains unresolved with multiple competing perspectives on methodology.

Contextual Notes

Participants note the importance of understanding time-dependence in the model's performance metrics, but the specifics of how to assess this remain unclear. There is also uncertainty about the implications of computed values for model accuracy.

bradyj7
Messages
117
Reaction score
0
How to quantify performance of a model

Hello,

I was hoping that somebody could advise me how to do evaluate the performance of a model.

The model simulates the power demand from a fleet of electric cars over 24 hours.

For example take the blue line as the actual demand for a given day and the black lines (100) are simulated demands.

https://dl.dropbox.com/u/54057365/All/demand.JPG

Could anybody advise me on how I could quantify the performance of a the model? For example it varies by 5%, 10% etc? Would you use RMSE?

I'm not sure if error is the correct word because the blue line varies from day to day depending which day you pick.

Thank you
 
Last edited by a moderator:
Physics news on Phys.org
What do you want to do with your performance value?
How do you simulate those 100 curves?
I think the blue curve is an average over many cars?

To start, you could consider the mean squared deviation between blue line and simulations as time-dependent function.
 
Hi,

Thanks for your reply.

The black curve were generated by a model that I have made to predict the power demands of the cars over the day. The model uses data collect by cars.

The blue is the actual average power demand used by the cars. I create it my averaging the power demand of all the days in the dataset.

I want a performance value to quote how accurate or not the model is in general. Is it capable of generating reliable power demand profiles? How reliable?

Is mean squared deviation the same as mean square error?

http://en.wikipedia.org/wiki/Mean_squared_error

Is it essentially computing the difference between the blue line and the black line?

Would it be a good idea to compute the mean squared deviation between the blue line and the 100 black lines and then take the average mean squared deviation?

I computed the mse between the blue and 1 blak line ans an example in this workbook. I got an answer of 309. But what does this tell me about it accuracy?

https://dl.dropbox.com/u/54057365/All/MSE.xlsx

Thanks
 
Last edited by a moderator:
Is mean squared deviation the same as mean square error?
If that deviation is the error of your simulation, right.
Is it essentially computing the difference between the blue line and the black line?
The average difference.
Would it be a good idea to compute the mean squared deviation between the blue line and the 100 black lines and then take the average mean squared deviation?
I would average over the 100 black lines first, and check the time-dependence. If that is flat and you don't care about the deviation at specific times, you can average over all.

I computed the mse between the blue and 1 blak line ans an example in this workbook. I got an answer of 309. But what does this tell me about it accuracy?
Lower=better. sqrt(309)≈17 - on average, your black line is wrong by 17 kW.
 
I would average over the 100 black lines first, and check the time-dependence. If that is flat and you don't care about the deviation at specific times, you can average over all.

Thank you for your reply. I don't quite follow the point above.

Are you implying above that I compute the MSE between the blue line and the 100 black lines? Getting 100 MSE values?

How would you check for time-dependence?

Thank you
 
bradyj7 said:
Are you implying above that I compute the MSE between the blue line and the 100 black lines? Getting 100 MSE values?
No, just a single one (for each point in time): The meanSE.

How would you check for time-dependence?
Can be seen in the resulting graph.
 

Similar threads

Replies
1
Views
3K
Replies
9
Views
3K
  • · Replies 11 ·
Replies
11
Views
2K
  • · Replies 15 ·
Replies
15
Views
3K
  • · Replies 0 ·
Replies
0
Views
4K
  • · Replies 1 ·
Replies
1
Views
3K
  • · Replies 0 ·
Replies
0
Views
3K
  • · Replies 3 ·
Replies
3
Views
4K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 14 ·
Replies
14
Views
3K