| ²é¿´: 1237 | »Ø¸´: 3 | ||
xuh1991ľ³æ (ÕýʽдÊÖ)
|
[ÇóÖú]
BPÉñ¾ÍøÂçºÍÌݶÈϽµ·¨µÄÒÉÎÊ£¿ ÒÑÓÐ2È˲ÎÓë
|
| ÔÚBPÉñ¾ÍøÂçÖУ¬Ä¬ÈϵÄѵÁ·º¯Êý¾ÍÊÇÌݶÈϽµ·¨£¬¼´°´ÕÕÌݶÈϽµ·¨À´¸üÐÂȨֵ£¬ÄÇôÕâÑùµÄBPÉñ¾ÍøÂçºÍÌݶÈϽµ·¨µÄÇø±ðÔÚÄÄÀïÄØ£¿ |
» ²ÂÄãϲ»¶
290Çóµ÷¼Á
ÒѾÓÐ7È˻ظ´
Ò»Ö¾Ô¸ÖÐÄÏ»¯Ñ§£¨0703£©×Ü·Ö337Çóµ÷¼Á
ÒѾÓÐ8È˻ظ´
±±¿Æ281ѧ˶²ÄÁÏÇóµ÷¼Á
ÒѾÓÐ5È˻ظ´
Çóµ÷¼ÁÒ»Ö¾Ô¸ÄϾ©º½¿Õº½Ìì´óѧ289·Ö
ÒѾÓÐ3È˻ظ´
AÇøÏß²ÄÁÏѧµ÷¼Á
ÒѾÓÐ5È˻ظ´
²ÄÁÏѧ˶297ÒѹýËÄÁù¼¶Çóµ÷¼ÁÍÆ¼ö
ÒѾÓÐ11È˻ظ´
316Çóµ÷¼Á
ÒѾÓÐ5È˻ظ´
Ò»Ö¾Ô¸Î÷ÄϽ»Í¨ ר˶ ²ÄÁÏ355 ±¾¿ÆË«·Ç Çóµ÷¼Á
ÒѾÓÐ5È˻ظ´
Ò»Ö¾Ô¸Î人Àí¹¤²ÄÁϹ¤³Ìר˶µ÷¼Á
ÒѾÓÐ8È˻ظ´
329Çóµ÷¼Á
ÒѾÓÐ8È˻ظ´
syddesk
ľ³æ (ÕýʽдÊÖ)
- Ó¦Öú: 125 (¸ßÖÐÉú)
- ½ð±Ò: 2181.4
- É¢½ð: 302
- ºì»¨: 17
- Ìû×Ó: 474
- ÔÚÏß: 130Сʱ
- ³æºÅ: 3188437
- ×¢²á: 2014-05-07
- ÐÔ±ð: GG
- רҵ: ¿ØÖÆÀíÂÛÓë·½·¨
¡¾´ð°¸¡¿Ó¦Öú»ØÌû
¸Ðл²ÎÓ룬ӦÖúÖ¸Êý +1
|
Éñ¾ÍøÂçÊÇÒ»ÖÖ½¨Ä£µÄ˼Ï룬¶ÔÊý¾Ý½øÐÐÄâºÏ£¬´ýÇóµÄÊÇÒ»×é×îÓŵIJÎÊý£» ÌݶÈϽµÊÇÓÃÀ´Çó½âÕâ×é²ÎÊýµÄ·½·¨¡£ Á½ÕßÊDz»Ò»ÑùµÄ |
2Â¥2016-09-02 17:56:02
FMStation
ÖÁ×ðľ³æ (ÖªÃû×÷¼Ò)
- Ó¦Öú: 591 (²©Ê¿)
- ¹ó±ö: 0.03
- ½ð±Ò: 18344.7
- ºì»¨: 97
- Ìû×Ó: 8891
- ÔÚÏß: 1489.9Сʱ
- ³æºÅ: 2400059
- ×¢²á: 2013-04-04
- רҵ: ¼ÆËã»úÓ¦Óü¼Êõ
¡¾´ð°¸¡¿Ó¦Öú»ØÌû
¸Ðл²ÎÓ룬ӦÖúÖ¸Êý +1
|
An analogy for understanding gradient descent The basic intuition behind gradient descent can be illustrated by a hypothetical scenario. A person is stuck in the mountains and is trying to get down (i.e. trying to find the minima). There is heavy fog such that visibility is extremely low. Therefore, the path down the mountain is not visible, so he must use local information to find the minima. He can use the method of gradient descent, which involves looking at the steepness of the hill at his current position, then proceeding in the direction with the steepest descent (i.e. downhill). If he was trying to find the top of the mountain (i.e. the maxima), then he would proceed in the direction steepest ascent (i.e. uphill). Using this method, he would eventually find his way down the mountain. However, assume also that the steepness of the hill is not immediately obvious with simple observation, but rather it requires a sophisticated instrument to measure, which the person happens to have at the moment. It takes quite some time to measure the steepness of the hill with the instrument, thus he should minimize his use of the instrument if he wanted to get down the mountain before sunset. The difficulty then is choosing the frequency at which he should measure the steepness of the hill so not to go off track. https://en.wikipedia.org/wiki/Fi ... o_input_weights.png In this analogy, the person represents the backpropagation algorithm, and the path taken down the mountain represents the sequence of parameter settings that the algorithm will explore. The steepness of the hill represents the slope of the error surface at that point. The instrument used to measure steepness is differentiation (the slope of the error surface can be calculated by taking the derivative of the squared error function at that point). The direction he chooses to travel in aligns with the gradient of the error surface at that point. The amount of time he travels before taking another measurement is the learning rate of the algorithm. https://en.wikipedia.org/wiki/Ba ... ng_gradient_descent |
3Â¥2016-09-03 19:51:07
picklas
ľ³æ (ÖøÃûдÊÖ)
- Ó¦Öú: 4 (Ó×¶ùÔ°)
- ½ð±Ò: 6161.9
- É¢½ð: 200
- ºì»¨: 4
- Ìû×Ó: 2140
- ÔÚÏß: 84.9Сʱ
- ³æºÅ: 4104480
- ×¢²á: 2015-09-26
- ÐÔ±ð: GG
- רҵ: Äý¾Û̬ÎïÐÔI:½á¹¹¡¢Á¦Ñ§ºÍ
|
²»ÊÇÒ»¸ö²ã´ÎµÄ·½·¨£¬Ñ§Ï°¹æÔò¿ÉÒÔÊÇÌݶÈϽµµ«Ò²¿ÉÄܲ»ÊÇ£¬¾ßÌåÒªÊÇÎÊÌâ¶ÔÏóÊýѧģÐͶø¶¨ ·¢×ÔСľ³æIOS¿Í»§¶Ë |
4Â¥2016-09-04 04:57:27













»Ø¸´´ËÂ¥