Why is convolution used to represent LTI output?

Click For Summary

Discussion Overview

The discussion revolves around the use of convolution to represent the output of linear time-invariant (LTI) systems. Participants explore the mathematical and conceptual foundations of convolution, its implications for system behavior, and the relationship between input and output signals in various contexts.

Discussion Character

  • Exploratory
  • Technical explanation
  • Conceptual clarification
  • Debate/contested

Main Points Raised

  • Some participants question why the output of an LTI system depends on both the current input and previous inputs, suggesting that convolution models this relationship through an integral that accounts for all past inputs.
  • Others point out that while convolution is a common representation for LTI systems, there exist systems that do not require convolution, prompting a discussion on the generality of convolution in describing LTI behavior.
  • One participant proposes that the output can be expressed as a sum of contributions from past inputs, leading to a generalized form that transitions from summation to integration.
  • Another viewpoint emphasizes the linearity of the system, arguing that convolution allows for independent handling of different frequency components without cross-modulation.
  • Some participants provide intuitive explanations of convolution, likening it to the cumulative effect of responses to individual impulses, which can be visualized graphically.
  • A participant introduces the idea of local linearity in contributions to the output, suggesting that if the relationship were nonlinear, the output would not remain linear with respect to the input.
  • There is mention of specific forms of LTI systems, including differential equations, and how they relate to convolution through impulse response and linearity principles.

Areas of Agreement / Disagreement

Participants express various viewpoints on the necessity and implications of convolution in LTI systems. There is no consensus on whether convolution is the only or best representation for all LTI systems, and multiple competing views remain regarding the interpretation and application of convolution.

Contextual Notes

Some discussions highlight the limitations of mathematical proofs in providing physical insight, and there are unresolved questions about the relationship between different domains of input and output signals.

Who May Find This Useful

This discussion may be of interest to students and professionals in electrical engineering, signal processing, and systems theory, particularly those seeking to understand the conceptual underpinnings of convolution in LTI systems.

Nikitin
Messages
734
Reaction score
27
Hi. If you have a LTI system with an impulse response function ##h(t)## taking in an input ##x(t)##, why does its output ##y(t)## become

8ed8359fb3bb3943e6b2dd89810d041f.png
bfc7561dba4b77877ded4fef8496bfd6.png


?

I realize ##y(t)## not only depends on the instantaneous input ##x(t)##, but also on the lingering effects of previous inputs ##x(\tau)## with ##\tau < t##. But how does the convolution integral model this?
 
  • Like
Likes   Reactions: Stephen Tashi
Engineering news on Phys.org
OK I found a mathematical proof on wikipedia. It is coherent, but gives zero physical insight (is this the norm in EE?).

http://en.wikipedia.org/wiki/LTI_system_theory#Continuous-time_systems

However, why does the input ##x(u)## and output ##y(t)## have different domains there? Is it because the range of ##y(t)## is directly dependent on every element in the range of ##x(t)##?
 
There are LTI systems that do not need to be represented by a convolution. So I think the best intuitive approach is to ask why the most general LTI system is represented by a convolution and why all apparently simpler LTI systems can be regarded as being implemented by a convolution.

Suppose the rule for a system S(t) is "the output y(t) is 4 times the input x(t)".
The notation for this can be confusing, since S can be regarded as function t, but we are also thinking of it as function of x(t).
So , abusing notation, S(t) = S( x(t) ) = 4 x(t).
This system is linear. S( A g(t) + B r(t)) = A S(g(t)) + B S(r(t)).
The output is shift invariant in the sense that S(t-h) = S(x(t-h)) = S applied to input function evalated at t-h..

A slightly more general system is
S(t) = S( x(t)) ) = 4 x(t) + 3 x(t-1) + 2 x(t+1)
This is also linear and shift invariant.

This suggests a more general form for LTI systems is:
S(t) = \sum_{i=0}^N A_i x(t-h_i) where the A_i and h_i are constants.

If the h_i are all distinct numbers then we can define a function g(h) that maps the value h_i to the coefficient A_i

Rewriting the above summation in terms of g():

S(t) = \sum_{i=0}^N g(h_i) x(t - h_i)

To further generalize, we replace summation by integration

S(t) = \int_H g(h) x(t-h) dh where H is the range of values that h may take.
 
  • Like
Likes   Reactions: Nikitin
Nikitin said:
OK I found a mathematical proof on wikipedia. It is coherent, but gives zero physical insight (is this the norm in EE?).
Mathematical solutions are deliberately generalised to avoid physical insight as that makes mathematics universally applicable.

So, without using mathematical symbols, why convolution? The important thing is that, being linear, there is no cross-modulation of the signals in the system. All the different frequencies in the response are therefore amplitude and phase independent of each other. The transfer function is then simply the amplitude and phase response of the system to any sinusoidal stimulation. That is not true, as you suggest in the time domain, where all the stimulations and responses are summed in all of time.
 
  • Like
Likes   Reactions: Nikitin
I have a more intuitive means of appreciating the power of convolutions.
If you think of the LTI as it responds to a short impulse, you have the impulse function.
If you imagine the input signal as a stream of impulses of various magnitudes, you can imagine the system responding to each impulse.
If you graph the response to the first impulse, then graph the response of the second response under it, and continue making graphs down the page, you'll see that the responses are all the same except for magnitude and a slight delay to account for when each impulse happens.

Now, if you make a "totaled" graph at the bottom by adding the values of each graph (y_total(t) = x1(t) + x2(t) +x3(t)... ) above, you get the cumulative effect of the signal over time.
This is essentially what happens when you perform the integration, but this has always seemed "right" to me as it allows you to better imagine the process behind the math.
 
  • Like
Likes   Reactions: Nikitin
The way I look at it, the contribution to the output that input x() makes is "locally linear". At t -h it adds some constant multiple of x(t-h) to the output at t. So you can think of the net output as a sum of the form g(h)x(t-h) where g(h) is a function that tells you what constant to multiply x(t-h) by. (This probably amounts to the "impulse response" view.) If the contribution of the input at t-h was a nonlinear expression like (x(t-h))^2 then the output wouldn't be a linear function of the input.
 
  • Like
Likes   Reactions: Nikitin
The simplest way to think about convolution is that for a system that responds immediately that is linear, we get y(t)=hx(t). If it doesn't respond immediately, we can try y(t) =h(0).x(t) + h(1).x(t-1) + h(2)x(t-2) + ..., which written in summation notation is y(t) = sum {h(n)x(t-n)} which is the convolution.

An explicitly non-convolutional form for an LTI system is dy/dt = -y + x. Here I chose the minus sign so that the system doesn't explode when the input x is zero. I think you can show using the impulse response view (make x a delta function) and linearity for superposition to get the convolutional form out of it. I'm not sure I got everything there correct, but you can google "Green's function" for the thing that translates between the differential and integral forms for LTI systems. Also, here I've only used 1 variable.

http://en.wikipedia.org/wiki/Green's_function
http://en.wikipedia.org/wiki/Impulse_response

When one goes to nonlinear systems, the most "general" "nonexploding" convolutional TI approximator in some sense is Volterra series. This precise conditions of this theorem is given by Boyd and Chua http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.126.9363. The usual convolution is just the linear term of the Volterra series.
 
Last edited:

Similar threads

  • · Replies 5 ·
Replies
5
Views
5K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 4 ·
Replies
4
Views
3K
  • · Replies 2 ·
Replies
2
Views
2K
  • · Replies 4 ·
Replies
4
Views
3K
  • · Replies 1 ·
Replies
1
Views
1K
  • · Replies 2 ·
Replies
2
Views
5K
  • · Replies 3 ·
Replies
3
Views
4K
Replies
2
Views
2K
  • · Replies 4 ·
Replies
4
Views
4K