Reinforcement Learning - Return Function

  • Thread starter Thread starter tsaitea
  • Start date Start date
  • Tags Tags
    Function
AI Thread Summary
The discussion centers on the use of the index "k" in summation, clarifying its role as a dummy index distinct from "t," which indicates the first term in the summation. The necessity of using a new index like "k" arises because summing over "t" would lead to confusion, as "t" would be replaced and disappear during the expansion of the sum. The explanation emphasizes that "k" is chosen to avoid this issue, ensuring clarity in the summation process.
tsaitea
Messages
19
Reaction score
0
TL;DR Summary
In this post on the Return function is indexed by k?
1576126497380.png

Where did the k come from? I was expecting the index to be t.
 
Technology news on Phys.org
##t## tells you the index of the first term you are summing over, so you can't sum over that. You need to sum over a new index, called a dummy index - they've chosen ##k##. Note that if you expand the sum, you will replace ##k## with 0 in the zeroth term, replace it with 1 in the first term, and so on - ##k## disappears when you expand the sum. If you'd summed over ##t##, ##t## would likewise be replaced and disappear when you expanded sum, which is not what you want.
 
Dear Peeps I have posted a few questions about programing on this sectio of the PF forum. I want to ask you veterans how you folks learn program in assembly and about computer architecture for the x86 family. In addition to finish learning C, I am also reading the book From bits to Gates to C and Beyond. In the book, it uses the mini LC3 assembly language. I also have books on assembly programming and computer architecture. The few famous ones i have are Computer Organization and...
What percentage of programmers have learned to touch type? Have you? Do you think it's important, not just for programming, but for more-than-casual computer users generally? ChatGPT didn't have much on it ("Research indicates that less than 20% of people can touch type fluently, with many relying on the hunt-and-peck method for typing ."). 'Hunt-and-peck method' made me smile. It added, "For programmers, touch typing is a valuable skill that can enhance speed, accuracy, and focus. While...
I had a Microsoft Technical interview this past Friday, the question I was asked was this : How do you find the middle value for a dataset that is too big to fit in RAM? I was not able to figure this out during the interview, but I have been look in this all weekend and I read something online that said it can be done at O(N) using something called the counting sort histogram algorithm ( I did not learn that in my advanced data structures and algorithms class). I have watched some youtube...
Back
Top