6天前(2025-05-09)

《Build a Large Language Model (From Scratch)》PDF下载

41
10
6天前
Build a Large Language Model (From Scratch)封面
41
10
语言:
english
作者:
Sebastian Raschka
出版社:
Manning
发布时间:
2024年1月
页数:
370
ISBN:
9781633437166
标签:

内容简介

For readers who know Python. Experience developing machine learning models is useful but not essential.

中文版地址:《从零构建大模型》

Learn how to create, train, and tweak large language models (LLMs) by building one from the ground up!

In Build a Large Language Model (from Scratch), you’ll discover how LLMs work from the inside out. In this insightful book, bestselling author Sebastian Raschka guides you step by step through creating your own LLM, explaining each stage with clear text, diagrams, and examples. You’ll go from the initial design and creation to pretraining on a general corpus, all the way to finetuning for specific tasks.

Build a Large Language Model (from Scratch) teaches you how to:

Plan and code all the parts of an LLM

Prepare a dataset suitable for LLM training

Finetune LLMs for text classification and with your own data

Use human feedback to ensure your LLM follows instructions

Load pretrained weights into an LLM

The large language models (LLMs) that power cutting-edge AI tools like ChatGPT, Bard, and Copilot seem like a miracle, but they’re not magic. This book demystifies LLMs by helping you build your own from scratch. You’ll get a unique and valuable insight into how LLMs work, learn how to evaluate their quality, and pick up concrete techniques to finetune and improve them.

The process you use to train and develop your own small-but-functional model in this book follows the same steps used to deliver huge-scale foundation models like GPT-4. Your small-scale LLM can be developed on an ordinary laptop, and you’ll be able to use it as your own personal assistant.

about the book

Build a Large Language Model (from Scratch) is a one-of-a-kind guide to building your own working LLM. In it, machine learning expert and author Sebastian Raschka reveals how LLMs work under the hood, tearing the lid off the Generative AI black box. The book is filled with practical insights into constructing LLMs, including building a data loading pipeline, assembling their internal building blocks, and finetuning techniques. As you go, you’ll gradually turn your base model into a text classifier tool, and a chatbot that follows your conversational instructions.

about the author

Sebastian Raschka has been working on machine learning and AI for more than a decade. Sebastian joined Lightning AI in 2022, where he now focuses on AI and LLM research, developing open-source software, and creating educational material. Prior to that, Sebastian worked at the University of Wisconsin-Madison as an assistant professor in the Department of Statistics, focusing on deep learning and machine learning research. He has a strong passion for education and is best known for his bestselling books on machine learning using open-source software.

下载

如果上方的下载按钮无法下载,可以使用此处的下载地址手动跳转。

文件网盘logo夸克网盘下载 下载地址: Build a Large Language Model (From Scratch)电子版下载地址提取码:qTXu

本站所有资源均经过人工检查,确保质量。每一个都是互联网上能收集到的质量最好的版本。对于多个版本的书籍,一般只收录最新版本。

本站所有资源均免费,如果您觉得还行,请分享给更多的人。如果您有任何问题,或者想贡献更优质的版本,可以点击下方【建议/报告问题】按钮提交。