Lecture:

HIDA Lecture: Exploring the World of LLMs

Wednesday, 20.03.2024 · 11 am
online

Speaker: Jan Ebert, Software Engineer and Researcher Large-Scale HPC Machine and Deep Learning, Forschungszentrum Jülich

Date: 20.03.2024, 11 am

Title: ChatGPT's Backgrounds: Exploring the World of Large Language Models  

Abstract

The talk will briefly introduce deep learning, which is driving the current AI revolution, and large language models (LLMs) in a historical context. It will then delve into various aspects of LLM training. Topics such as data, the training process and examples of working with and using LLMs will be covered. The presentation will conclude with an introduction to current state-of-the-art LLMs, applications in domains other than text, and future prospects.

 

Register here!

Jan Ebert

Jan Ebert has studied Cognitive Informatics and Intelligent Systems at Bielefeld University. With high interest in deep learning and high-performance computing, he started to work at Jülich Supercomputing Centre as Software Engineer and Researcher Large-Scale HPC Machine and Deep Learning, supporting researchers in various domains to apply artificial intelligence (AI) techniques for their research and co-founding LAION, an open community for open AI projects.

Recently, Jan has focussed on Transformers and large language models, working in projects such as OpenGPT-X and TrustLLM to create intelligent and thrustworthy language models for European languages.

Alternativ-Text

Subscribe newsletter