How does Parceval's theorem help with an efficient CQT?

  • Thread starter Thread starter lidnele
  • Start date Start date
  • Tags Tags
    Theorem
Click For Summary

Discussion Overview

The discussion centers around the application of Parseval's theorem in the context of implementing an efficient constant Q transform (CQT) for pitch detection in real-time applications. Participants explore the mathematical underpinnings and computational efficiency of the proposed algorithm derived from a paper by Judith Brown and Miller Puckette.

Discussion Character

  • Exploratory
  • Technical explanation
  • Homework-related

Main Points Raised

  • One participant seeks clarification on how Parseval's theorem contributes to the efficiency of the CQT implementation, particularly in relation to multiplying FFT'd kernels with FFT'd inputs versus time-domain multiplication.
  • Another participant notes that the sum in equation (5) may not be over N-1 terms due to many K* terms being close to zero, referencing figures in the article that illustrate this point.
  • A later reply acknowledges a previous oversight regarding the significance of the near-zero terms in the context of identifying discrete pitches, suggesting that small errors may be acceptable for the application at hand.

Areas of Agreement / Disagreement

Participants do not reach a consensus on the efficiency implications of Parseval's theorem in this context, and the discussion remains open with various interpretations and clarifications being offered.

Contextual Notes

There are limitations regarding the assumptions made about the efficiency of the algorithm and the specific conditions under which Parseval's theorem applies. The discussion also reflects varying levels of understanding of the mathematical concepts involved.

lidnele
Messages
3
Reaction score
0
Hi,

First, apologies if this is the wrong forum or even the wrong website. The site popped up when searching for a math forum to help me implement an equation efficiently. I’m not sure if this is the place for discussing computational efficiency, but since the efficiency is achieved by way of mathematical theorems, I thought it might fit.

I’m trying to implement an efficient version of the constant Q transform in an attempt to determine pitch for a real-time application, but the math part of my brain has been sitting unused for some years and Google doesn’t seem to hold much information.

I’ve stumbled upon an algorithm derived by Judith Brown and Miller Puckette in an (easy to skim) article here:
http://academics.wellesley.edu/Physics/brown/pubs/effalgV92P2698-P2701.pdf

I’ve verified that the algorithm works (although not yet its speed), but I need to understand it better.
The gist of the article is (if I understand correctly, of course) that a kernel (equation 4) can be pre-calculated and then used repeatedly for multiplication with the input signal. That seems reasonable: less processing power required, much more memory needed.

The article seems to imply that using a form of Parceval’s relation (equation 3) is what provides the speed. I can’t understand the efficiency gain in multiplying an FFT’d kernel with an FFT’ed input, as opposed to just performing the multiplication in the time domain (first and second part of equation (5) in the article).

What’s the catch? I guess what I’d like to know is what the use of Parceval’s theorem really is (especially coupled to the CQT).

If this is such an elementary question that you don’t want to reward it with a lengthy answer, pointers to relevant articles would be sufficient.

Also, if the first thought that pops into your head is “that’s not at all efficient. I know a much better way to do this”, please write away.

Thanks in advance.
 
Physics news on Phys.org
When I look at that document,the first page is blank. Anyone else able to see the first page ?
 
When I click on the link in my post, I get the PDF document displayed as it should. I can't see anything different between page one and the others (apart from the information, of course), so I don't know why the first page would appear to be invisible for you.

I use Safari and its built-in PDF reader, btw, although it shouldn't matter.
 
I think the point is that in equation (5) the sum is not really over N-1 terms because most of the K* terms are close to zero, as shown in Fig 2. Fig 3 is presumably some measure of the error introduced by dropping the near-zero terms. If the objective is to identify discrete 12-tone or 24-tone pitches in a scale, errors of a fraction of a tone interval (e.g. less than half) probably don't matter.
 
Of course. It's funny how one's mind works. I've read the article several times, but somehow I've completely missed that (which some might argue is a central point of the article).

It makes a lot more sense now. Thanks.
 

Similar threads

  • · Replies 3 ·
Replies
3
Views
2K
  • · Replies 3 ·
Replies
3
Views
3K
  • · Replies 1 ·
Replies
1
Views
4K
  • · Replies 13 ·
Replies
13
Views
3K
  • · Replies 5 ·
Replies
5
Views
2K
  • · Replies 69 ·
3
Replies
69
Views
17K
  • · Replies 3 ·
Replies
3
Views
3K
  • · Replies 11 ·
Replies
11
Views
6K
  • · Replies 1 ·
Replies
1
Views
3K
  • · Replies 4 ·
Replies
4
Views
2K