Kyle Hill on How ChatGPT works internally

  • Thread starter Thread starter jedishrfu
  • Start date Start date
  • Tags Tags
    chatgpt
AI Thread Summary
Kyle Hill explores the inner workings of ChatGPT, emphasizing the technical challenges faced by the GPT team in developing Large Language Models (LLMs). He references the GPT-3.5 paper, providing insights into the model's functioning without resorting to sensationalism. Hill clarifies that GPT is not sentient and lacks awareness of its outputs, highlighting the extensive training and statistical analysis involved in generating responses. Additionally, he discusses the temperature setting feature that influences GPT's responses and introduces tools for further exploration of GPT models. The content aims to demystify LLMs while maintaining a focus on their technical aspects rather than popular science narratives.
Messages
15,442
Reaction score
10,140
Kyle Hill gets into the internals of how ChatGPT works:

 
  • Like
  • Informative
Likes Demystifier, nsaspook, Tom.G and 2 others
Computer science news on Phys.org
Not complaining but I thought we weren't allowed to discuss pop sci in the forum
 
Feynstein100 said:
Not complaining but I thought we weren't allowed to discuss pop sci in the forum
There are some PF forums where it may be appropriate, depending on the subject. I haven't watched the video above, but it sounds interesting. Posting videos as sources in the technical Physics and Math forums is almost never a good idea.
 
This guy is a more serious if not silly science reporter and has some really good video content. He gets into some of the conceptual and technical challenges faced by the GPT team to make it work well. He cites the GPT-3.5 paper and provides insight into how things are done.
 
It's a decent video that gently describes the basics of Large Language Models (LLMs) and how they function. It's not in the PopSci vein of "ChatGPT will end life as we know it!!!"
 
  • Like
Likes Demystifier and jedishrfu
As a programmer, this was along the lines of something I wanted to see. I posted an earlier one in another thread where the presenter went through the GPT paper looking for things. But I haven't seen a followup to it.

Kyle is very emphatic when he says GPT is not sentient and that it doesn't know what its generating and then goes to show the extensive training and statistical analysis being used to select the best choice of words which was quite enlightening for me.
 
Here's another interesting video on using ChatGPT:



The presenter talks about using the temperature setting to change GPT responses and shows some related tools you can use to explore GPT models.
 

Similar threads

5
Replies
212
Views
13K
Replies
313
Views
26K
Replies
8
Views
3K
Replies
3
Views
3K
Replies
3
Views
1K
Replies
10
Views
3K
Back
Top