Summer Semester 2024
Prof. Dr. Nathan Gibson
In-depth: Large Language Models
–Break–
My office (IG 6.552)
5 minutes:
A few slides or screen-sharing is allowed, but make sure you can keep it to 5 minutes!
Please set up a meeting with me if you haven’t already.
Big Data: Data that defies “traditional methods” of processing or analysis because of its large scale.
Machine Learning: A process of using data to train software to recognize or predict patterns in new data
Ground truth: Correctly labeled data used for training and testing
Neural networks use a process that turns nodes on or off based on many different inputs, and then goes back and refines the “weight” of these inputs.
Large language models predict the next word(s) after having been trained on a very large dataset.
Lee, Timothy B. “A Jargon-Free Explanation of How AI Large Language Models Work.” Ars Technica, July 31, 2023. https://arstechnica.com/science/2023/07/a-jargon-free-explanation-of-how-ai-large-language-models-work/.
Advanced Processing & AI: Generate and Transcribe Audio and Video