学习笔记 | 深入理解LSTM网络 - 走看看

zoukankan html css js c++ java

学习笔记 | 深入理解LSTM网络
概要

Long Short Term Memory networks，简称作LSTM。

Recurrent Neural Networks

Recurrent Neural Networks，简称作RNN，其结构包含回环，可以用来解决有信息停留的问题。

其基本机构如下图所示：

如上图所示，神经网络的一块，A，获得输入xt，并产生输出ht。该结构允许把信息传递给下一个时刻。

一个RNN可以看成许多个相同神经网络的复制，一个神经网络把信息传递给下一个神经网络。如果我们把这个回环打开，就变成下图所示的结构：

这种结构自然被用于语音识别（speech recognition），语言建模（ language modeling），翻译（translation），图片抓取（image captioning）等问题中。应用范围还在不断扩展中。

长期依赖问题(The Problem of Long-Term Dependencies)

参考
- Christopher Olah's blog post on RNNs and LSTMs.
  
  This is the shortest and most accessible read.
  
  [post] 翻译理解LSTM网络
- Deep Learning Book chapter on RNNs.
  
  This will be a very technical read and is recommended for students very comfortable with advanced mathematical notation and scientific papers.
- Andrej Karpathy's lecture on Recurrent Neural Networks.
  
  This is a fairly long lecture (around an hour) but covers the content quite well as always with Karpathy.
- [post] Anyone Can Learn To Code an LSTM-RNN in Python (Part 1: RNN)
查看全文

相关阅读:
[转] 传统 Ajax 已死，Fetch 永生
 React组件属性部类（propTypes）校验
 [转]webpack进阶构建项目(一)
package.json 字段全解析
 [转]Nodejs基础中间件Connect
[转]passport.js学习笔记
 [转]Travis Ci的最接底气的中文使用教程
 建站笔记1：centos6.5下安装mysql
[软件人生]关于认知，能力的思考——中国城市里的无知现象片段
 一步一步学Spring.NET——1、Spring.NET环境准备

原文地址：https://www.cnblogs.com/casperwin/p/6396044.html

Copyright © 2011-2022 走看看