LLMs in Production
Engineering AI Applications
Échec de l'ajout au panier.
Échec de l'ajout à la liste d'envies.
Échec de la suppression de la liste d’envies.
Échec du suivi du balado
Ne plus suivre le balado a échoué
1 mois d'essai gratuit à Audible Standard
Acheter pour 28,13 $
-
Narrateur(s):
-
Christopher Kendrick
-
Auteur(s):
-
Christopher Brousseau
-
Matt Sharp
À propos de cet audio
Unlock the potential of Generative AI with this Large Language Model production-ready playbook for seamless deployment, optimization, and scaling. This hands-on guide takes you beyond theory, offering expert strategies for integrating LLMs into real-world applications using retrieval-augmented generation (RAG), vector databases, PEFT, LoRA, and scalable inference architectures. Whether you're an ML engineer, data scientist, or MLOps practitioner, you’ll gain the technical know-how to operationalize LLMs efficiently, reduce compute costs, and ensure rock-solid reliability in production.
What You’ll Learn:
- Master LLM Fundamentals – Understand tokenization, transformer architectures, and the evolution linguistics to the creation of foundation models.
- RAG & Vector Databases – Augment model capabilities with real-time retrieval and memory-optimized embeddings.
- Training vs Fine-tuning – Learn how to train your own model as well as cutting edge techniques like Distillation, RLHF, PEFT, LoRA, and QLoRA for cost-effective adaptation.
- Prompt Engineering – Discover the quickly evolving world of prompt engineering and go beyond simple prompt and pray methods and learn how to implement structured outputs, complex workflows, and LLM agents.
- Scaling & Cost Optimization – Deploy LLMs into your favorite cloud of choice, on commodity hardware, Kubernetes clusters, and edge devices.
- Securing AI Workflows – Implement guardrails for hallucination mitigation, adversarial testing, and compliance monitoring.
- MLOps for LLMs – Learn all about LLMOps, automate model lifecycle management, retraining pipelines, and continuous evaluation.
Hands-on Projects Include:
• Training a custom LLM from scratch – Build and optimize an industry-specific model.
• AI-Powered VSCode Extension – Use LLMs to enhance developer productivity with intelligent code completion.
• Deploying on Edge Devices – Run a lightweight LLM on a Raspberry Pi or Jetson Nano for real-world AI applications.
PLEASE NOTE: When you purchase this title, the accompanying PDF will be available in your Audible Library along with the audio.
©2024 Manning Publications (P)2025 Manning PublicationsVous pourriez aussi aimer...
-
Platform Engineering
- A Guide for Technical, Product, and People Leaders
- Auteur(s): Camille Fournier, Ian Nowland
- Narrateur(s): Holly Adams
- Durée: 14 h et 1 min
- Version intégrale
-
Au global1
-
Performance1
-
Histoire1
Until recently, infrastructure was the backbone of organizations' operating software they developed in-house. But now that cloud vendors run the computers, companies can finally bring the benefits of agile custom-centricity to their own developers. Adding product management to infrastructure organizations is now all the rage. But how's that possible when infrastructure is still the operational layer of the company?
Auteur(s): Camille Fournier, Autres
-
Building Applications with AI Agents
- Designing and Implementing Multiagent Systems
- Auteur(s): Michael Albada
- Narrateur(s): Nick Mondelli
- Durée: 12 h et 4 min
- Version intégrale
-
Au global0
-
Performance0
-
Histoire0
Generative AI has revolutionized how organizations tackle problems, accelerating the journey from concept to prototype to solution. As the models become increasingly capable, we have witnessed a new design pattern emerge: AI agents. By combining tools, knowledge, memory, and learning with advanced foundation models, we can now sequence multiple model inferences together to solve ambiguous and difficult problems. From coding agents to research agents to analyst agents and more, we've already seen agents accelerate teams and organizations.
Auteur(s): Michael Albada
-
Designing Machine Learning Systems
- An Iterative Process for Production-Ready Applications
- Auteur(s): Chip Huyen
- Narrateur(s): Kathleen Li
- Durée: 12 h et 55 min
- Version intégrale
-
Au global1
-
Performance1
-
Histoire1
Machine learning systems are both complex and unique. Complex because they consist of many different components and involve many different stakeholders. Unique because they're data dependent, with data varying wildly from one use case to the next. In this book, you'll learn a holistic approach to designing ML systems that are reliable, scalable, maintainable, and adaptive to changing environments and business requirements. Author Chip Huyen, cofounder of Claypot AI, considers each design decision in the context of how it can help your system as a whole achieve its objectives.
Auteur(s): Chip Huyen
-
Building Microservices
- Designing Fine-Grained Systems
- Auteur(s): Sam Newman
- Narrateur(s): Theodore O'Brien
- Durée: 21 h et 12 min
- Version intégrale
-
Au global7
-
Performance4
-
Histoire4
As organizations shift from monolithic applications to smaller, self-contained microservices, distributed systems have become more fine-grained. But developing these new systems brings its own host of problems. This expanded second edition takes a holistic view of topics that you need to consider when building, managing, and scaling microservices architectures. Through clear examples and practical advice, author Sam Newman gives everyone from architects and developers to testers and IT operators a firm grounding in the concepts.
Auteur(s): Sam Newman
-
Fundamentals of Software Architecture (2nd Edition)
- A Modern Engineering Approach
- Auteur(s): Neal Ford, Mark Richards
- Narrateur(s): Perry Daniels
- Durée: 16 h et 55 min
- Version intégrale
-
Au global1
-
Performance1
-
Histoire1
Salary surveys worldwide regularly place software architect in the top ten best jobs, yet no real guide exists to help developers become architects. Until now. This updated edition provides a comprehensive overview of software architecture's many aspects, with five new chapters covering the latest insights from the field. Aspiring and existing architects alike will examine architectural characteristics, architectural patterns, component determination, diagramming architecture, governance, data, generative AI, team topologies, and many other topics.
Auteur(s): Neal Ford, Autres
-
Designing Distributed Systems (2nd Edition)
- Patterns and Paradigms for Scalable, Reliable Systems Using Kubernetes
- Auteur(s): Brendan Burns
- Narrateur(s): Tom Beyer
- Durée: 8 h et 33 min
- Version intégrale
-
Au global1
-
Performance1
-
Histoire1
Author Brendan Burns demonstrates how you can adapt existing software design patterns for designing and building reliable distributed applications. Systems engineers and application developers will learn how these long-established patterns provide a common language and framework for dramatically increasing the quality of your system. This fully updated second edition includes new chapters on AI inference, AI training, and building robust systems for the real world.
Auteur(s): Brendan Burns
-
Platform Engineering
- A Guide for Technical, Product, and People Leaders
- Auteur(s): Camille Fournier, Ian Nowland
- Narrateur(s): Holly Adams
- Durée: 14 h et 1 min
- Version intégrale
-
Au global1
-
Performance1
-
Histoire1
Until recently, infrastructure was the backbone of organizations' operating software they developed in-house. But now that cloud vendors run the computers, companies can finally bring the benefits of agile custom-centricity to their own developers. Adding product management to infrastructure organizations is now all the rage. But how's that possible when infrastructure is still the operational layer of the company?
Auteur(s): Camille Fournier, Autres
-
Building Applications with AI Agents
- Designing and Implementing Multiagent Systems
- Auteur(s): Michael Albada
- Narrateur(s): Nick Mondelli
- Durée: 12 h et 4 min
- Version intégrale
-
Au global0
-
Performance0
-
Histoire0
Generative AI has revolutionized how organizations tackle problems, accelerating the journey from concept to prototype to solution. As the models become increasingly capable, we have witnessed a new design pattern emerge: AI agents. By combining tools, knowledge, memory, and learning with advanced foundation models, we can now sequence multiple model inferences together to solve ambiguous and difficult problems. From coding agents to research agents to analyst agents and more, we've already seen agents accelerate teams and organizations.
Auteur(s): Michael Albada
-
Designing Machine Learning Systems
- An Iterative Process for Production-Ready Applications
- Auteur(s): Chip Huyen
- Narrateur(s): Kathleen Li
- Durée: 12 h et 55 min
- Version intégrale
-
Au global1
-
Performance1
-
Histoire1
Machine learning systems are both complex and unique. Complex because they consist of many different components and involve many different stakeholders. Unique because they're data dependent, with data varying wildly from one use case to the next. In this book, you'll learn a holistic approach to designing ML systems that are reliable, scalable, maintainable, and adaptive to changing environments and business requirements. Author Chip Huyen, cofounder of Claypot AI, considers each design decision in the context of how it can help your system as a whole achieve its objectives.
Auteur(s): Chip Huyen
-
Building Microservices
- Designing Fine-Grained Systems
- Auteur(s): Sam Newman
- Narrateur(s): Theodore O'Brien
- Durée: 21 h et 12 min
- Version intégrale
-
Au global7
-
Performance4
-
Histoire4
As organizations shift from monolithic applications to smaller, self-contained microservices, distributed systems have become more fine-grained. But developing these new systems brings its own host of problems. This expanded second edition takes a holistic view of topics that you need to consider when building, managing, and scaling microservices architectures. Through clear examples and practical advice, author Sam Newman gives everyone from architects and developers to testers and IT operators a firm grounding in the concepts.
Auteur(s): Sam Newman
-
Fundamentals of Software Architecture (2nd Edition)
- A Modern Engineering Approach
- Auteur(s): Neal Ford, Mark Richards
- Narrateur(s): Perry Daniels
- Durée: 16 h et 55 min
- Version intégrale
-
Au global1
-
Performance1
-
Histoire1
Salary surveys worldwide regularly place software architect in the top ten best jobs, yet no real guide exists to help developers become architects. Until now. This updated edition provides a comprehensive overview of software architecture's many aspects, with five new chapters covering the latest insights from the field. Aspiring and existing architects alike will examine architectural characteristics, architectural patterns, component determination, diagramming architecture, governance, data, generative AI, team topologies, and many other topics.
Auteur(s): Neal Ford, Autres
-
Designing Distributed Systems (2nd Edition)
- Patterns and Paradigms for Scalable, Reliable Systems Using Kubernetes
- Auteur(s): Brendan Burns
- Narrateur(s): Tom Beyer
- Durée: 8 h et 33 min
- Version intégrale
-
Au global1
-
Performance1
-
Histoire1
Author Brendan Burns demonstrates how you can adapt existing software design patterns for designing and building reliable distributed applications. Systems engineers and application developers will learn how these long-established patterns provide a common language and framework for dramatically increasing the quality of your system. This fully updated second edition includes new chapters on AI inference, AI training, and building robust systems for the real world.
Auteur(s): Brendan Burns