Machine Learning - Empirical Error

Click For Summary

Homework Help Overview

The discussion revolves around understanding the concept of empirical error in machine learning, specifically focusing on the summation notation used in error calculation. Participants are examining the implications of the notation and its relation to the definition of error.

Discussion Character

  • Conceptual clarification, Assumption checking

Approaches and Questions Raised

  • The original poster attempts to clarify the role of the summation and the meaning of the "1" in the context of error calculation. They question whether the error should be defined as the absolute difference between hypothesized and actual values. Other participants discuss the interpretation of the indicator function notation and its relevance to calculating average error rates.

Discussion Status

Participants are actively engaging with the notation and its implications. Some have expressed understanding and agreement with the interpretations presented, while others continue to explore the definitions and assumptions underlying the error calculation.

Contextual Notes

The discussion references a specific text, "Foundations of Machine Learning," which may influence the interpretations and assumptions being discussed. There is an emphasis on understanding the notation without resolving the broader implications of empirical error.

YoshiMoshi
Messages
233
Reaction score
10
Homework Statement
See Below
Relevant Equations
See below
1599949282377.png

I understand everything in this equation except for the summation. I understand it's the average error over the sample. But why do we need the "1"? Moreover wouldn't the error be the absolute value of the hypothesized value minus the concept value? Meaning
| h( x_i ) - c( x_i ) |
because you have to take the difference between the two to get the error? The original statement in the summation is just saying that the two are not equal. How is this an error?

The above snipping came from a book titled Foundations of Machine Learning by M. Mohri, Afshin Rostamizadeh, Ameet Talwalkar. It's for free on semantic scholar, and this is the beginning of chapter 2.

https://www.semanticscholar.org/pap...e9239469aba4bccf3e36d1c27894721e8dbefc44?p2df
 
Physics news on Phys.org
I think the notation ##1_{h(x)\neq c(x)}## means it takes the value 1 if the subscript is true, i.e. ##h(x) \neq c(x)##, and 0 otherwise.

I guess as long as for each data point it's either an error or not, without further quantification, this calculates the average error rate in your sample.
 
Hey thanks, that would make perfect sense.
 
Office_Shredder said:
I think the notation ##1_{h(x)\neq c(x)}## means it takes the value 1 if the subscript is true, i.e. ##h(x) \neq c(x)##, and 0 otherwise.
That's in agreement with what's in the whitepaper. The author calls ##1_\omega## the "indicator function of the event ##\omega##."
 

Similar threads

  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 2 ·
Replies
2
Views
414
  • · Replies 1 ·
Replies
1
Views
5K
  • · Replies 13 ·
Replies
13
Views
4K
Replies
6
Views
4K
  • · Replies 7 ·
Replies
7
Views
4K
  • · Replies 7 ·
Replies
7
Views
4K