deep learing basics

C-B-LIU發表於2019-03-21

原文網址 : https://blog.csdn.net/lllllllllllllcb/article/details/88709378

accuracy: fraction of the images that were correctly classified
overfitting: gap between training accuracy and test accuracy as example
tensors: multidimensional Numpy arrays
tensors are a generalization of matrices to an arbitrary number of dimensions
scalar: 0D tensor, 0 axes, ndim == 0, rank == 0, point in hyperplane
vector: 1D tensor, 1 axes, ndim == 1, axis can have many dimensions
matrices: 2D tensor, 2 axes, ndim == 2, first axis is row, second axis is columns
3D tensor: ndim == 3
key attributes
1. number of axes: rank, ndim
2. shape: how many dimensions along each axis
3. data type: string tensor don’t exist in Numpy
tensor slicing: independent slice, :means selecting the entire axis
data tensors:
1. feature data: 2D (samples, features)
2. time series: 3D (samples, timesteps, features)
3. image: 4D (samples, height, width, channels) (tensorflow backend)
4. video: 5D (samples, frames, height, width, channels)
tensor operation:
1. element wise: Basic Linear Algebra Subprograms (BLAS)
2. broadcasting: broadcast small tensor to match the shape of the larger tensor
3. tensor dot: dimension change: (a, b, c, d) . (d, e) -> (a, b, c, e)
4. tensor reshape: element number is not changed
5. all tensor operations are geometric transformations of the input data
tensor operations are differentiable
gradient is the derivative of a tensor operation, take tensor as inputs
gradient(f) (W0)is the gradient of f(W) = loss_valuein W0
optimize follow the opposite direction: W1 = W0 - step*gradient(f)(W0)
gradient optimization: O(N)
mini-batch stochastic gradient descent: optimize on random small batch
true SGD: randomly pick single sample at a time and perform optimization
bath SGD: optimize on entire sample set
loss surfaces: phase space of model fitting parameters
momentum:

past_velocity = 0
momentum = 0.1
while loss > 0.01:
	weight, loss, gradient = get_current_parameters()
	velocity = past_velocity * momentum + learning_rate * gradient
	weight = weight + momentum * velocity - learning_rate * gradient
	past_velocity = velocity
	update_parameter(weight)

back-propagation: http://galaxy.agh.edu.pl/~vlsi/AI/backp_t_en/backprop.html
symbolic differentiation: tensorflow
procedure:
1. input data
2. reshape data
3. construct network
4. network-compilation
5. train: each iteration over all the training data is called an epoch
layers:
1. dense: vector data (samples, features)
2. recurrent: sequence data (samples, timesteps, features)
3. image: 2D convolution
layer input and output: 784 input, 32 output

from keras import models
from keras import layers
model = models.Sequential()
model.add(layers.Dense(32, input_shape=(784, )))
model.add(layers.Dense(32)) # input 32

topology of a network defines a hypothesis space
network topology define the tensor operation series
for multi-loss networks, all losses are combined via a function (average) into a single scalar quantity, for serving gradient-descent process
loss function:
1. binary crossentropy for two-class classification
2. categorical crossentropy for many class classification
3. mean squared error for regression problem
4. connectionist temporal classification for sequence-learning problem

Arrays Basics
2024-11-16
Markdown: Basics （快速入門）
2019-02-19
Lecture 02 Recap of CG Basics
2024-09-01
ch17_multithreading_basics
2024-09-13
thread
CppCon 2019 | Back to Basics: RAII and The Rule of Zero
2021-04-20
AI
IMBALANCED TARGET DISTRIBUTIONS LEARING(目標類別不平衡學習)
2024-03-17
CODY Contest 2020 Basics - Binary Logic全10題
2020-12-05
Deep "Hello world!"
2024-04-22
Deep Global Registration
2020-09-26
5407. Deep
2020-12-26
Vue 中的樣式穿透 v-deep、/deep/ 和 >>>
2023-02-21
Vue穿透
Deep learning - note 1
2018-11-01
Deep Learning with Differential Privacy
2024-04-09
《DEEP LEARNING·深度學習》
2024-05-05
深度學習
js deep clone 深克隆
2019-02-19
JS
深度學習（Deep Learning）
2022-08-17
深度學習
【譯】kotlin 協程官方文件（1）-協程基礎（Coroutine Basics）
2020-03-21
Kotlin
Deep Hashing Network for Efficient Similarity Retrieval
2020-10-08
MILA
Deep Learn I'm back.
2021-01-16
瀋陽網路賽I-Lattice's basics in digital electronics【模擬】
2020-04-05
Git
Flutter基礎（四）開發Flutter應用前需要掌握的Basics Widget
2019-06-05
Flutter
Flutter 基礎（四）開發 Flutter 應用前需要掌握的 Basics Widget
2019-06-07
Flutter
Manning.Deep.Learning.with.Python.2017.11.pdf
2018-10-09
Python
《推薦系統》-Wide&Deep
2020-10-20
IDE
COMP9444 Neural Networks and Deep Learning
2024-06-21
DEEP LEARNING WITH PYTORCH: A 60 MINUTE BLITZ | TENSORS
2022-01-19
PyTorch
Join Query Optimization with Deep Reinforcement Learning Algorithms
2020-12-27
Go
Cozmo人工智慧機器人SDK使用筆記（1）-基礎部分basics
2019-01-08
人工智慧機器人筆記
《Docker Deep Dive》閱讀筆記（一）
2019-09-17
Docker筆記
深度學習（Deep Learning）優缺點
2020-02-23
深度學習
「Wide & Deep Learning for Recommender Systems」- 論文摘要
2020-02-28
IDE
DEEP LEARNING WITH PYTORCH: A 60 MINUTE BLITZ | NEURAL NETWORKS
2022-01-21
PyTorch
Deep Upsupervised Cardinality Estimation 解讀（2019 VLDB）
2022-03-03
DEEP LEARNING WITH PYTORCH: A 60 MINUTE BLITZ | TRAINING A CLASSIFIER
2022-01-22
PyTorchAI
論文解讀（DGI）《DEEP GRAPH INFOMAX》
2021-09-19
Deep Embedding Learning for Text-Dependent Speaker Verification
2020-11-23
Deep和Cross不得不說的秘密
2018-10-17
ROS
貝葉斯深度學習（bayesian deep learning）
2019-01-17
深度學習

deep learing basics

相關文章