Machine Learning - Empirical Error

Click For Summary
The discussion focuses on understanding the summation in the empirical error equation from "Foundations of Machine Learning." The key point is that the notation 1_{h(x) ≠ c(x)} serves as an indicator function, assigning a value of 1 when the hypothesis does not match the concept and 0 otherwise. This allows for calculating the average error rate across the sample without quantifying the magnitude of the error. The participants clarify that this approach effectively captures whether each data point is an error or not. Overall, the indicator function is essential for determining the empirical error rate in machine learning models.
YoshiMoshi
Messages
233
Reaction score
10
Homework Statement
See Below
Relevant Equations
See below
1599949282377.png

I understand everything in this equation except for the summation. I understand it's the average error over the sample. But why do we need the "1"? Moreover wouldn't the error be the absolute value of the hypothesized value minus the concept value? Meaning
| h( x_i ) - c( x_i ) |
because you have to take the difference between the two to get the error? The original statement in the summation is just saying that the two are not equal. How is this an error?

The above snipping came from a book titled Foundations of Machine Learning by M. Mohri, Afshin Rostamizadeh, Ameet Talwalkar. It's for free on semantic scholar, and this is the beginning of chapter 2.

https://www.semanticscholar.org/pap...e9239469aba4bccf3e36d1c27894721e8dbefc44?p2df
 
Physics news on Phys.org
I think the notation ##1_{h(x)\neq c(x)}## means it takes the value 1 if the subscript is true, i.e. ##h(x) \neq c(x)##, and 0 otherwise.

I guess as long as for each data point it's either an error or not, without further quantification, this calculates the average error rate in your sample.
 
Hey thanks, that would make perfect sense.
 
Office_Shredder said:
I think the notation ##1_{h(x)\neq c(x)}## means it takes the value 1 if the subscript is true, i.e. ##h(x) \neq c(x)##, and 0 otherwise.
That's in agreement with what's in the whitepaper. The author calls ##1_\omega## the "indicator function of the event ##\omega##."
 
Question: A clock's minute hand has length 4 and its hour hand has length 3. What is the distance between the tips at the moment when it is increasing most rapidly?(Putnam Exam Question) Answer: Making assumption that both the hands moves at constant angular velocities, the answer is ## \sqrt{7} .## But don't you think this assumption is somewhat doubtful and wrong?

Similar threads

  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 13 ·
Replies
13
Views
3K
Replies
6
Views
4K