2.24日学习笔记|2.26日学习笔记|2.27|2.28

zoukankan html css js c++ java

2.24日学习笔记|2.26日学习笔记|2.27|2.28
1.ord函数
>>> ord('a') 97 >>> chr(97) 'a'
就是上面这个样子，返回ASCII值，有值就输出对应的字符。

2.GPU不是用来计算的吗？为什么还可以存储数据呢？

https://www.cnblogs.com/hellobb/p/11023873.html

是可以存储数据的，不然进行高速计算中间结果之类的存储在什么地方呢？。。

CPU的内存一般称之为主存（main memory），GPU自己的存储则称为local memory，即GPU的本地存储，有时候也称为video memory。

就像显卡集显和２G独显之类的，这都是它们的本地存储容量.

3.GRU了解

https://zhuanlan.zhihu.com/p/32481747

内部结构像LSTM，但是门和参数都少了，有重置门和更新门，计算能力和时间成本需求较小。

4.

尝试运行这个：
import torch a=torch.tensor([1,2,3]) out=a.data.view(-1).div(0.1).exp() #运行不起来，会自动结束。。
就会直接RESTART: Shell。。。

终于找到了一些：https://zhuanlan.zhihu.com/p/31170525
# 这里是将输出转化为一个多项式分布 output_dist = output.data.view(-1).div(temperature).exp() # 从而可以根据混乱度 temperature 来选择下一个字符 # 混乱度低，则趋向于选择网络预测最大概率的那个字符 # 混乱度高，则趋向于随机选择字符 top_i = torch.multinomial(output_dist, 1)[0]
torch.multinomial 学习

意思是，从每一行中根据多项式分布采样，每行都采nums_samples个样，然后返回这个tensor。
>>> weights = torch.tensor([0, 10, 3, 0], dtype=torch.float) >>> torch.multinomial(weights, 2) tensor([1, 2]) >>> torch.multinomial(weights, 3) Traceback (most recent call last): File "<pyshell#78>", line 1, in <module> torch.multinomial(weights, 3) RuntimeError: invalid argument 2: invalid multinomial distribution (with replacement=False, not enough non-negative category to sample) at c:aw1swindowspytorchatensrc hgeneric/THTensorRandom.cpp:320 >>> torch.multinomial(weights, 4, replacement=True) tensor([1, 1, 1, 1])
经过以上的实验，首先它是根据input中数据值的大小来采样的，然后默认是不采0的，只采正数.下面可以看出它不采负数的。
>>> w=torch.tensor([0, 10, -3, 0], dtype=torch.float) >>> torch.multinomial(w, 2) Traceback (most recent call last): File "<pyshell#81>", line 1, in <module> torch.multinomial(w, 2) RuntimeError: invalid argument 2: invalid multinomial distribution (encountering proba
5.指数加权平均

https://www.jianshu.com/p/41218cb5e099?utm_source=oschina-app

它能够模拟点的趋势，并且将过去给考虑进来，能够平滑曲线；应用到SGD的时候，能够加快收敛速度，减少震荡。

还能将长期趋势显现出来，通常设置β为0.9。

2-26——————————————

3.距离和相似度度量方法

https://blog.csdn.net/pipisorry/article/details/45651315

5.GEO到底哪些表示真实数据呢？

https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi

2-27——————————

1.JS散度：

https://www.cnblogs.com/smuxiaolei/p/7400923.html

https://zxth93.github.io/2017/09/27/KL%E6%95%A3%E5%BA%A6JS%E6%95%A3%E5%BA%A6Wasserstein%E8%B7%9D%E7%A6%BB/index.html

JS是对称的而且值是有界的[0,1]. 当P1和P2完全相同时，JS=0；当不同时为1.

分别用P1和P2来和第三个分布，相当于一个中间分布来计算KL散度，相比于直接KL就会有一个折中？

2.itertools.chain.from_iterable(l)

https://stackoverflow.com/questions/50775024/itertools-chain-from-iterable

展平一个嵌套的列表。仅仅是展平，不去重。
import itertools country_and_countrycodes = [('United States', 'US'), ('China', 'CH')] all_countries = ['United States', 'Mongolia', 'Togo'] print(list(itertools.chain.from_iterable(country_and_countrycodes))) print(list(itertools.chain.from_iterable(all_countries))) #输出： ['United States', 'US', 'China', 'CH'] ['U', 'n', 'i', 't', 'e', 'd', ' ', 'S', 't', 'a', 't', 'e', 's', 'M', 'o', 'n', 'g', 'o', 'l', 'i', 'a', 'T', 'o', 'g', 'o']
从这里就能看出来它的作用了吧，就是解决嵌套一层的列表。往里走一层。

2-28————————————————————————————————

1.python常用函数

https://cloud.tencent.com/developer/article/1140244 字符串函数、集合函数、列表函数和字典函数。

https://www.cnblogs.com/ljhdo/p/10664286.html

1.filter函数，可以代替循环的if判断，很厉害
>>> ret = filter(lambda x: x % 2 == 0, range(10)) >>> ret <filter object at 0x000002191AEF7DD8> >>> list(ret) [0, 2, 4, 6, 8]
第一个参数为需要满足的条件（一个函数），第二个为一个可迭代循环的对象吧，比如list，然后就可以每个元素去判断是否符合function的条件。

2.list的sort和内置的sort

list的sort是直接对list操作修改，不返回任何内容；内置sort是会返回一个新的排好序的list，不在原list上修改。

有一点特殊讲解是，针对按关键字排序的方法：
>>> list = [('d',3),('a',5),('d',1),('c',2),('d',2)] >>> sorted(list,key=lambda x:x[1]) [('d', 1), ('c', 2), ('d', 2), ('d', 3), ('a', 5)]
也就是当按固定键排序的时候，用key关键字，然后lambda匿名函数。

3.print的格式化输出，这个需要学习
print(format % args)
格式就是这个样子，前面是format字符串，然后由%分隔，args为参数；
>>> name='hh' >>> age=20 >>> print('my name is %s, %d years old'%(name,age)) my name is hh, 20 years old
就挺有用的。其中%d表示整数，%s字符串，%发表示浮点数。

还有不使用%号，而对所有输出格式一视同仁的{}peihe.format（变量1，变量2.。。）
name = 'langxm' age = '28' corp = "NetEase" print("hello my name is {} i am {} i worked at {}".format(name, age, corp))
4.非常常见的map函数

将function应用到每个可迭代对象上：
map(func, seq1[, seq2,…])
>>> map(lambda x: x*x , [1, 2, 3, 4, 5]) <map object at 0x000002191AF0ED68> >>> list(map(lambda x: x*x , [1, 2, 3, 4, 5]))#这里居然不能用list，难道只能用 #for循环访问吗？ Traceback (most recent call last): File "<pyshell#19>", line 1, in <module> list(map(lambda x: x*x , [1, 2, 3, 4, 5])) TypeError: 'list' object is not callable >>> m=map(lambda x: x*x , [1, 2, 3, 4, 5])#输出只有一个seq可迭代对象时 >>> for i in m: print(i) 1 4 9 16 25 >>> t=map(lambda x,y:(x**y,x+y),[1,2,3],[1,2,4])#当有两个seqs时，也可以操作。 >>> for i in t: print(i) (1, 2) (4, 4) (81, 7)
2.nlp小项目

https://zhuanlan.zhihu.com/p/51279338

3.关于CEL计算损失时

https://blog.csdn.net/ccbrid/article/details/79844005

如果时出现，Dimension specified as 0 but tensor has no dimensions，就
loss += criterion(output, target_var[c].unsqueeze(0))
就插入一个维度；

如果是不能有 RuntimeError: multi-target not supported (newbie)的错误，那么就squeeze(0)即可，就是维度的调整，因为CEL交叉熵损失要求target是一个一维的向量吧。

4.torch.bmm(input, mat2, out=None) → Tensor

将存储在input和mat2中的多个矩阵进行批次矩阵乘法操作，两个输入中所含的矩阵数目必须是一样的。

下面的代码就很容易能够看出来它的功能。
>>> input = torch.randn(10, 3, 4) >>> mat2 = torch.randn(10, 4, 5) >>> res = torch.bmm(input, mat2) >>> res.size() torch.Size([10, 3, 5])
查看全文

相关阅读:
使用mongoose--写接口
 数据结构上机实验（2）
时间复杂度十道练习题目
 Python网络爬虫实战入门
 区分矩阵的三大关系
 用python检查矩阵的计算
 ab矩阵(实对称矩阵)
Python大数据应用
 数据结构上机实验（1）
0038. Count and Say (E)

原文地址：https://www.cnblogs.com/BlueBlueSea/p/12357911.html

2.24日学习笔记|2.26日学习笔记|2.27|2.28

1.ord函数

2.GPU不是用来计算的吗？为什么还可以存储数据呢？

3.GRU了解

4.

5.指数加权平均

2-26——————————————

3.距离和相似度度量方法

5.GEO到底哪些表示真实数据呢？

2-27——————————

1.JS散度：

2.itertools.chain.from_iterable(l)

2-28————————————————————————————————

1.python常用函数

2.nlp小项目

3.关于CEL计算损失时

4.torch.bmm(input, mat2, out=None) → Tensor

4.`torch.bmm`(input, mat2, out=None) → Tensor