programming matrix

Notes to myself that might be helpful to others...

Monday, November 14, 2016

Stochastic Gradient Descent: Momentum and Nesterov Momentum

http://www.christianherta.de/lehre/dataScience/machineLearning/neuralNetworks/nesterov-momentum.php

https://blogs.princeton.edu/imabandit/2013/02/05/orf523-advanced-optimization-introduction/

https://blogs.princeton.edu/imabandit/2015/06/30/revisiting-nesterovs-acceleration/

https://arxiv.org/pdf/1405.4980v2.pdf

https://en.wikipedia.org/wiki/Convex_hull

http://mathworld.wolfram.com/ConvexHull.html

https://en.wikipedia.org/wiki/Convex_combination

https://en.wikipedia.org/wiki/Tensor
Posted by Derek Pappas at 11:49 AM
Newer Post Older Post Home
View mobile version

My Blog List

  • (fn Arc Language Blog)
    The stack circuitry of the Intel 8087 floating point chip, reverse-engineered
    6 days ago
  • Geeking with Greg
    My book, Algorithms and Misinformation
    1 year ago
  • Joel on Software
    Welcome, Prashanth!
    6 years ago
  • Engineering @ Facebook's Facebook Notes
    Facebook Syndication Error
    9 years ago
  • Skrentablog
    blekko launches izik: tablet search reimagined
    12 years ago
  • Engineering Rapleaf
  • The Database Column
  • find23.net

Followers

Blog Archive

  • ►  2025 (1)
    • ►  April (1)
  • ►  2024 (2)
    • ►  December (1)
    • ►  August (1)
  • ►  2023 (9)
    • ►  December (1)
    • ►  August (8)
  • ►  2022 (1)
    • ►  March (1)
  • ►  2021 (14)
    • ►  September (2)
    • ►  May (12)
  • ►  2020 (25)
    • ►  November (3)
    • ►  October (1)
    • ►  September (8)
    • ►  May (2)
    • ►  April (2)
    • ►  February (4)
    • ►  January (5)
  • ►  2019 (33)
    • ►  October (7)
    • ►  September (10)
    • ►  August (5)
    • ►  April (4)
    • ►  March (1)
    • ►  February (6)
  • ►  2018 (28)
    • ►  December (2)
    • ►  November (5)
    • ►  May (4)
    • ►  February (1)
    • ►  January (16)
  • ►  2017 (157)
    • ►  December (30)
    • ►  November (10)
    • ►  October (10)
    • ►  September (23)
    • ►  August (29)
    • ►  July (19)
    • ►  June (9)
    • ►  May (12)
    • ►  April (6)
    • ►  February (5)
    • ►  January (4)
  • ▼  2016 (121)
    • ►  December (9)
    • ▼  November (6)
      • Parallelizing HTMLPage Downloaders
      • New deep learning formula for NLP
      • Stochastic Gradient Descent: Momentum and Nesterov...
      • Installing CUDA on Ubuntu 16.04
      • Scala pattern matching
      • Disable ASUS Motherboard's UEFI secure boot
    • ►  October (9)
    • ►  September (3)
    • ►  August (6)
    • ►  July (15)
    • ►  June (4)
    • ►  May (21)
    • ►  April (14)
    • ►  March (34)
  • ►  2015 (3)
    • ►  September (2)
    • ►  July (1)
  • ►  2014 (23)
    • ►  February (5)
    • ►  January (18)
  • ►  2013 (7)
    • ►  September (2)
    • ►  July (1)
    • ►  April (2)
    • ►  February (2)
  • ►  2012 (17)
    • ►  August (2)
    • ►  June (11)
    • ►  March (1)
    • ►  February (3)
  • ►  2011 (2)
    • ►  January (2)
  • ►  2010 (13)
    • ►  July (2)
    • ►  June (2)
    • ►  April (3)
    • ►  March (3)
    • ►  January (3)
  • ►  2009 (17)
    • ►  October (1)
    • ►  August (1)
    • ►  May (1)
    • ►  March (2)
    • ►  February (3)
    • ►  January (9)
  • ►  2008 (9)
    • ►  December (9)

About Me

Derek Pappas
View my complete profile
Simple theme. Powered by Blogger.