Impressive Video Data Compression

AI Thread Summary
The content discusses advancements in video streaming technology, particularly focusing on data compression techniques that significantly reduce bandwidth requirements. It highlights the blurred lines between algorithms and AI in this context, emphasizing the potential for both innovative applications and the risk of facilitating deep fakes. The technology bears similarities to methods used for animating old photos, where facial points from a recorded actor are mapped to those in static images for video rendering. Additionally, it touches on traditional video compression methods, such as the use of I-frames and P-frames, which optimize data by storing full frames and only the changes between them. Overall, the discussion underscores the dual-edged nature of these advancements in video technology.
anorlunda
Staff Emeritus
Science Advisor
Homework Helper
Insights Author
Messages
11,326
Reaction score
8,750
https://arxiv.org/pdf/2011.15126.pdf
https://nvlabs.github.io/face-vid2vid/
https://wandb.ai/ayush-thakur/face-...hesis-for-Video-Conferencing--Vmlldzo1MzU4ODc

One thing in this modern world seems to be ubiquitous; the demand for streaming more and more video. The
data compression in these papers, appears to be a significant step forward in reducing the bandwidth required.

I don't know whether to call it an algorithm or an AI. The difference is blurry.

On the dark side, it also appears to enable much simpler production of deep fakes.

 
  • Wow
  • Like
Likes Twigg and Borg
Computer science news on Phys.org
Interesting compression. Thanks for sharing.

This seems very similar to the recent animation of old photos using the old photo and a digitized actor doing the actions to be the guide for the video rendering.

They record an actor and key facial points for each frame and then map the old photo facial points to the actors facial points to render the scene.

I imagine too the artifacts can be ignored as side-effects of the video transmission in the viewers mind.

Standard video does something simpler with i-frames and p-frames where the i-frame is a full frame of the image (like jpg or bmp) and the p-frame is what changed.

https://en.wikipedia.org/wiki/Video_compression_picture_types
 
In my discussions elsewhere, I've noticed a lot of disagreement regarding AI. A question that comes up is, "Is AI hype?" Unfortunately, when this question is asked, the one asking, as far as I can tell, may mean one of three things which can lead to lots of confusion. I'll list them out now for clarity. 1. Can AI do everything a human can do and how close are we to that? 2. Are corporations and governments using the promise of AI to gain more power for themselves? 3. Are AI and transhumans...
Thread 'ChatGPT Examples, Good and Bad'
I've been experimenting with ChatGPT. Some results are good, some very very bad. I think examples can help expose the properties of this AI. Maybe you can post some of your favorite examples and tell us what they reveal about the properties of this AI. (I had problems with copy/paste of text and formatting, so I'm posting my examples as screen shots. That is a promising start. :smile: But then I provided values V=1, R1=1, R2=2, R3=3 and asked for the value of I. At first, it said...
Back
Top