Making Sense of Notation Confusion in Statistical Digital Signal Processing

Click For Summary

Discussion Overview

The discussion revolves around the challenges of notation in statistical digital signal processing, particularly regarding the representation of random variables and their distributions. Participants explore the implications of using different notations and the impact on communication within the field.

Discussion Character

  • Debate/contested
  • Conceptual clarification
  • Technical explanation

Main Points Raised

  • One participant expresses frustration with notation confusion, noting that their choice of variable names (e.g., \omega_m) is met with resistance from colleagues who have different conventions.
  • Another participant argues that adhering to established notation is important for effective communication, suggesting that using unconventional symbols can lead to cognitive dissonance.
  • A third participant emphasizes the utility of common symbols as a shared language, acknowledging the difficulty of learning these conventions.
  • Several participants point out that the notation used for random variables and their distributions can lead to confusion, particularly regarding subscripts and their implications for independence and distribution.
  • One participant suggests that clearer notation could involve explicitly stating the independence of random variables and using alternative formats for listing them.
  • There is a mention of the existence of standards in signal processing that could guide notation choices, prompting one participant to consider checking for these standards.

Areas of Agreement / Disagreement

Participants generally agree that notation is important for communication but disagree on the flexibility of using different notations. Some advocate for strict adherence to conventions, while others believe that clarity of explanation should take precedence over notation.

Contextual Notes

Participants note that the choice of notation can vary significantly between fields, leading to potential misunderstandings. The discussion highlights the need for clarity in defining variables and their relationships, especially when transitioning between deterministic and probabilistic frameworks.

tworitdash
Messages
104
Reaction score
25
I started my research in statistical digital signal processing two years ago, so I need to familiarize myself with all the notations people use in probability and statistics. I come from a deterministic science background. I name my variables based on what they mean. A velocity is a v, a position is p and so on.

However, at some point, I had to define a set of variables that are random draws from a distribution. I called the variable \omega_m, and there are M of them. So I wrote something like the following.

\omega_m \sim \mathcal{N}(\mu_{\omega}, \sigma^2_{\omega}) and I explained what the variables mean. When I explain this to my colleagues or write it in a forum, some people are upset/ confused with my notation, making it unnecessarily hard (in my opinion) to convey the message (although it is quite clear in my head). For example, for the situation given above, statisticians comment that I should not use \omega_m itself as a variable name because its upper-case letter \Omega represents a parameter for a probability space. Some people even get confused when a random variable of this kind is represented with a lowercase letter. They believe it should always be written in an upper-case letter. So, one possible way to please a statistician is to write it like the following.

$$ X_m \sim \mathcal{N}(\mu_, \sigma^2) $$

So, not only did I get rid of the alphabet \omega (that made sense to me based on the variable I had in mind earlier), but I also made it uppercase.

When I show it to someone familiar with signal processing, they are upset with uppercase letters because they represent matrices, not variables.

For a clean and concise paper, why does it matter that we always have to respect some notation? Does the notation matter if we explain what we are trying to say?

To defend myself: I have read many papers and seen people using very different notations to explain the same thing in the past. For example, a Laplacian was represented as \Delta, and sometimes it was represented as \nabla^2.

I ask this here because, as a researcher, most of my time goes into explaining people things based on their understanding, which takes a lot of effort and time. It also sometimes makes me feel dumb, but later I realize that the notations and rules make it difficult (not my understanding of things).
 
Physics news on Phys.org
tworitdash said:
For a clean and concise paper, why does it matter that we always have to respect some notation? Does the notation matter if we explain what we are trying to sayn?
In the middle of your rant I found a question!!
The answer is easy: communication is a two way street. The purpose of writing a paper is to facilitate same. Why not write your work using only Cyrillic and Arabic? Part of learning a field is to understand conventions. If the work is truly extraordinary, your audience may put up with the inconvenience. Seldom does that work out.
Yes it does matter. For instance I usually like threespace to be denoted x,y,z or ##r,\theta,\phi## and parameter to be ##\alpha,\beta,\gamma## . Otherwise I get cognitive dissonance.
 
  • Like
Likes   Reactions: berkeman and tworitdash
It can be a pain to learn but it's a common language to use common symbols.

$$\omega^2_o=(2\pi f_o)^2=\dfrac{1}{LC}$$
You can also have a scratchpad or user profile with handy Greek Letters
- to copy & paste from like Δ Ω σ μ τ ω β δ η ϕθ λ π ζ ∞ ° √ Δ ∂ ∫ Ʃ ± ≈ ≠
 
  • Like
Likes   Reactions: tworitdash
You use a subscript (m) for the random variable, but not for the distribution. All your R.V.'s have same distribution?
 
  • Like
  • Skeptical
Likes   Reactions: tworitdash and TonyStewart
mathman said:
You use a subscript (m) for the random variable, but not for the distribution. All your R.V.'s have same distribution?
Yes, they are i.i.d. I didn't mention that in the question.
 
Last edited:
I think one confusing thing about
##
\omega_m = \mathcal{N}\left(\mu_{\omega},\sigma^2_{\omega}\right)
##
is this: the subscript m on the left indicates a sequence of random quantities while the subscript $\omega$ indicates fixed values for the mean and standard deviation [which they apparently are since you said i.i.d].
Saying something like $$\omega_1, \omega_2, \dots, \omega_M$$ are i.i.d. $$\mathcal{N}\left(\mu,\sigma^2\right)$$ makes it clearer. You could also use $$\{\omega\}_{i=1}^M$$ instead of the list format.
In terms of whether to follow notational[sic] conventions from signal processing or from prob/stat: there must be an existing set of standards for this in signal processing? Look for that.
 
  • Like
Likes   Reactions: tworitdash
statdad said:
I think one confusing thing about
##
\omega_m = \mathcal{N}\left(\mu_{\omega},\sigma^2_{\omega}\right)
##
is this: the subscript m on the left indicates a sequence of random quantities while the subscript $\omega$ indicates fixed values for the mean and standard deviation [which they apparently are since you said i.i.d].
Saying something like $$\omega_1, \omega_2, \dots, \omega_M$$ are i.i.d. $$\mathcal{N}\left(\mu,\sigma^2\right)$$ makes it clearer. You could also use $$\{\omega\}_{i=1}^M$$ instead of the list format.
In terms of whether to follow notational[sic] conventions from signal processing or from prob/stat: there must be an existing set of standards for this in signal processing? Look for that.
Thank you! It makes things clear for me. I will definitely check the standards if there is any.
 

Similar threads

  • · Replies 15 ·
Replies
15
Views
4K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 4 ·
Replies
4
Views
2K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 7 ·
Replies
7
Views
4K
  • · Replies 1 ·
Replies
1
Views
2K
  • · Replies 2 ·
Replies
2
Views
7K
  • · Replies 10 ·
Replies
10
Views
4K
Replies
1
Views
7K
  • · Replies 87 ·
3
Replies
87
Views
8K