深度学习是一种机器学习形式,能让计算机从经验中学习并从概念层次理解世界。本书涵盖深度学习众多主题,提供线性代数、概率论、信息论、数值计算和机器学习相关概念等数学和概念背景知识,描述工业界从业者使用的深度学习技术,如深度前馈网络、正则化、优化算法、卷积网络、序列建模和实用方法学等,还涵盖自然语言处理、语音识别、计算机视觉、在线推荐系统、生物信息学和视频游戏等应用,然后提供线性因子模型、自动编码器、表示学习、结构化概率模型、蒙特卡洛方法、配分函数、近似推理和深度生成模型等理论研究视角。可供计划在工业界或研究领域发展的本科生或研究生以及想在产品或平台中使用深度学习的软件工程师使用,有网站为读者和教师提供补充材料。
作者简介
伊恩·古德菲洛是 Google的研究科学家。
约书亚·本吉奥是蒙特利尔大学的计算机科学教授。
亚伦·考维尔是蒙特利尔大学的计算机科学助理教授。
An introduction to a broad range of topics in deep learning, covering mathematical and conceptual background, deep learning techniques used in industry, and research perspectives.“Written by three experts in the field, Deep Learning is the only comprehensive book on the subject.” —Elon Musk, cochair of OpenAI; cofounder and CEO of Tesla and SpaceXDeep learning is a form of machine learning that enables computers to learn from experience and understand the world in terms of a hierarchy of concepts. Because the computer gathers knowledge from experience, there is no need for a human computer operator to formally specify all the knowledge that the computer needs. The hierarchy of concepts allows the computer to learn complicated concepts by building them out of simpler ones; a graph of these hierarchies would be many layers deep. This book introduces a broad range of topics in deep learning. The text offers mathematical and conceptual background, covering relevant concepts in linear algebra, probability theory and information theory, numerical computation, and machine learning. It describes deep learning techniques used by practitioners in industry, including deep feedforward networks, regularization, optimization algorithms, convolutional networks, sequence modeling, and practical methodology; and it surveys such applications as natural language processing, speech recognition, computer vision, online recommendation systems, bioinformatics, and videogames. Finally, the book offers research perspectives, covering such theoretical topics as linear factor models, autoencoders, representation learning, structured probabilistic models, Monte Carlo methods, the partition function, approximate inference, and deep generative models. Deep Learning can be used by undergraduate or graduate students planning careers in either industry or research, and by software engineers who want to begin using deep learning in their products or platforms. A website offers supplementary material for both readers and instructors.