Classifiers, threshold, and ROC curve

  • Thread starter Thread starter fog37
  • Start date Start date
  • Tags Tags
    Threshold
AI Thread Summary
A classifier is a machine learning model that categorizes data into two or more classes. Classifiers can be probabilistic, providing a probability score compared against a threshold (commonly 0.5) for decision-making, or deterministic, which do not necessarily rely on a threshold. The discussion highlights the ability to plot the ROC curve for binary classifiers, which is influenced by true positive rate (TPR) and false positive rate (FPR) across various thresholds. However, there is debate over whether all deterministic classifiers utilize a threshold for decision-making and whether ROC curves can be plotted for any classifier type. The term "deterministic classifier" is not widely recognized, and the diversity of classification algorithms complicates generalizations. Additionally, some algorithms like K-nearest neighbors (KNN) and support vector machines (SVMs) can function as both classifiers and predictors, further adding to the complexity of classification discussions in machine learning.
fog37
Messages
1,566
Reaction score
108
TL;DR Summary
Classifiers, threshold, and ROC curve
Hello,

A classifier is a ML model that can classify between 2 or more classes. Some classifiers are called probabilistic in the sense that they output a probability score that is then compared against a threshold value (usually 0.5) to make the class decision. Other classifiers are not probabilistic...I guess they are called deterministic. We can always plot the ROC curve for a binary classifier. The ROC curve depends on TPR, FPR and various explored threshold values. The TPR and FPR vary for different threshold values...

Do all deterministic classifiers make their decision also based on some set threshold? If so, does it mean that we can plot the ROC curve for any classifier, probabilistic or not?

Thank you!
 
Technology news on Phys.org
fog37 said:
TL;DR Summary: Classifiers, threshold, and ROC curve

Hello,

A classifier is a ML model that can classify between 2 or more classes. Some classifiers are called probabilistic in the sense that they output a probability score that is then compared against a threshold value (usually 0.5) to make the class decision. Other classifiers are not probabilistic...I guess they are called deterministic. We can always plot the ROC curve for a binary classifier. The ROC curve depends on TPR, FPR and various explored threshold values. The TPR and FPR vary for different threshold values...

Do all deterministic classifiers make their decision also based on some set threshold? If so, does it mean that we can plot the ROC curve for any classifier, probabilistic or not?

Thank you!
I'm not aware of any probabilistic classifier. Usually you just compare the predicted with the actual known value/class of elements in the Testing set., all, like you said, given a threshold, so that, e.g., a threshold of 0.6 will give us a given Confusion Matrix Can you give us examples of probabilistic classifiers?
 
fog37 said:
Some classifiers are called probabilistic in the sense that they output a probability score that is then compared against a threshold value (usually 0.5) to make the class decision.
No, that is not what a probabilistic classifier does: https://en.wikipedia.org/wiki/Probabilistic_classification

fog37 said:
Do all deterministic classifiers make their decision also based on some set threshold?
No: first of all the term 'deterministic classifier' is not generally recognised, and secondly you should revise your understanding of this material and consider whether your question makes sense given the diversity of classification algorithms.

fog37 said:
If so, does it mean that we can plot the ROC curve for any classifier, probabilistic or not?
Once you have revised this material you should be able to see whether this question is relevent.
 
pbuk said:
No, that is not what a probabilistic classifier does: https://en.wikipedia.org/wiki/Probabilistic_classification


No: first of all the term 'deterministic classifier' is not generally recognised, and secondly you should revise your understanding of this material and consider whether your question makes sense given the diversity of classification algorithms.


Once you have revised this material you should be able to see whether this question is relevent.
Confusingly, Knn is sometimes described as a predictor, some times as a classifier.
 
WWGD said:
Confusingly, Knn is sometimes described as a predictor, some times as a classifier.
Yes, in a field as diverse and dynamic as machine learning categorisation and making generalisations in the way the OP is trying to do is IMHO a waste of time.
 
pbuk said:
Yes, in a field as diverse and dynamic as machine learning categorisation and making generalisations in the way the OP is trying to do is IMHO a waste of time.
Same goes for SVMs, also listed for both Classification and Regression
 
Dear Peeps I have posted a few questions about programing on this sectio of the PF forum. I want to ask you veterans how you folks learn program in assembly and about computer architecture for the x86 family. In addition to finish learning C, I am also reading the book From bits to Gates to C and Beyond. In the book, it uses the mini LC3 assembly language. I also have books on assembly programming and computer architecture. The few famous ones i have are Computer Organization and...
I have a quick questions. I am going through a book on C programming on my own. Afterwards, I plan to go through something call data structures and algorithms on my own also in C. I also need to learn C++, Matlab and for personal interest Haskell. For the two topic of data structures and algorithms, I understand there are standard ones across all programming languages. After learning it through C, what would be the biggest issue when trying to implement the same data...
Back
Top