Build A Deepseek Model (From Scratch)

Build A Deepseek Model (From Scratch)

Out of Stock
SKU: DADAX163343432X
UPC: 9781633434325
Brand: Manning
Regular price$72.77
Sold out
Quantity
Add to wishlist
Add to compare

Sold by Ergodebooks, an authorized reseller.

Returns accepted within 30 days | support@ergodebooks.com

Verified
Shipping Information
  • Free Standard Shipping — United States only
  • Processing Time: 1–3 business days
  • Estimated Delivery: 3–5 business days after dispatch
  • Double-boxed, fully insured & discreetly packaged
  • Tracking number sent via email once dispatched
  • Orders over $250 require signature upon delivery. Taxes calculated at checkout.
Returns & Refund

Returns accepted within 30 days of delivery.

Damaged or Defective Item

Free return shipping + replacement or full refund

Wrong Item Received

Free return shipping + replacement or full refund

Change of Mind

Return shipping at customer's expense · 25% restocking fee applies

All returns require a Return Authorization (RA) number before sending.

To initiate a return, contact us:

support@ergodebooks.com +1 (281) 738-1050
View Full Return & Refund Policy
Payment Option
Payment Methods

Help

If you have any questions, you are always welcome to contact us. We'll get back to you as soon as possible, withing 24 hours on weekdays.

Customer service

All questions about your order, return and delivery must be sent to our customer service team by e-mail at yourstore@yourdomain.com

Sale & Press

If you are interested in selling our products, need more information about our brand or wish to make a collaboration, please contact us at press@yourdomain.com

Get A Free Ebook (Pdf Or Epub) From Manning As Well As Access To The Online Livebook Format (And Its Ai Assistant That Will Answer Your Questions In Any Language) When You Purchase The Print Book.When Deepseek Started Making Waves In January 2025, It Sounded Too Good To Be True. How Could A Generative Ai Model Get Such Incredible Performance With Such Low Training And Operation Costs? By Creatively Blending A Variety Of Strategies And Innovations Like Mixture Of Experts, Latent Attention, MultiToken Prediction, Model Distillation, And Efficient Parallelization, Deepseek Set A New Standard For WhatS Possible In An Open Llm.Now, In This Book You Can Recreate A LaptopScale Version Of This CuttingEdge Model Yourself! Learn How To Build The Features That Set Deepseek Apart From Other Top Llms!In Build A Deepseek Model (From Scratch) You Will Learn How To: Implement DeepseekS Core Architectural Innovations, Including MultiHead Latent Attention And MixtureOfExperts Layers Build A ProductionReady Training Pipeline With MultiToken Prediction And Fp8 Quantization For Efficiency And Speed Maximize Hardware Utilization With Parallelism Strategies Like Dualpipe Apply PostTraining Methods Such As Supervised FineTuning And Reinforcement Learning To Unlock Reasoning Capabilities Compress And Distill Large Models Into Smaller, Deployable Versions For RealWorld Usein Build A Deepseek Model (From Scratch) YouLl Build Your Own Deepseek Clone From The Ground Up. First, YouLl Quickly Review Llm Fundamentals, With An Eye To Where DeepseekS Innovations Address The Common Problems And Limitations Of Standard Models. Then, YouLl Learn Everything You Need To Create Your Own DeepseekInspired Model, Including The Innovations That Put Deepseek On The Map: Multihead Latent Attention (Mla), MultiToken Prediction (Mtp), Mixture Of Experts (Moe), Model Distillation, And Reasoning.About The Bookbuild A Deepseek Model (From Scratch) Uses Intuitive Visualizations, Code Walkthroughs, And A ProblemSolution Narrative To Transform Complex Concepts Into Practical Skills. You Will Start By Coding A Deepseekattention Module, Progress To Building A Fully Functional Moe Layer, And Set Up A HighEfficiency Training Pipeline. By The End Of The Book, You Will Have A Fully Operational MiniDeepseek That Runs On Your Laptop, Along With The Skills To Extend And Optimize It For Your Own Research Or Production Applications.About The Readerfor IntermediateToAdvanced Ml Engineers, Ai Researchers, And Graduate Students Who Want To Go Beyond Prebuilt Models. YouLl Need To Know Deep Learning And Python Programming.About The Authordr. Raj Abhijit Dandekar Is A Computer Scientist And CoFounder Of Vizuara Ai Labs, An Online Education Platform That Has Trained Over 50,000 Students Globally. He Holds A Phd From Mit And Is The Lead Instructor Of The Popular Youtube Series Build Deepseek From Scratch.Dr. Rajat Dandekar, Phd In Mechanical Engineering From Purdue University, Specializes In Applying Machine Learning To Complex Physical Systems. He CoFounded Vizuara Ai Labs.Naman Dwivedi Is An Ai Researcher At Vizuara Ai Labs, Specializing In Turning Advanced Deep Learning Concepts Into HandsOn, Practical Code.Dr. Sreedath Pana Holds A Phd From Mit And Is A CoFounder Of Vizuara Ai Labs. He Is An Inventor And Ai Engineer Known For Creating SelfCleaning AiPowered Solar Technology.

⚠️ WARNING (California Proposition 65):

This product may contain chemicals known to the State of California to cause cancer, birth defects, or other reproductive harm.

For more information, please visit www.P65Warnings.ca.gov.

Recently Viewed