์ธ๊ณต์ง€๋Šฅ ์ •๋ฆฌ [๋ณธ๋ก 7] :: ๊นŠ์–ด์ง„ ์ธ๊ณต์‹ ๊ฒฝ๋ง์˜ ๋ฌธ์ œ์ ?
์ปดํ“จํ„ฐ๊ณผํ•™ (CS)/AI 2020. 2. 29. 19:07

๊นŠ์–ด์ง„ ์ธ๊ณต์‹ ๊ฒฝ๋ง์˜ ๋ฌธ์ œ์  Generalization : training์— ์‚ฌ์šฉ๋˜์ง€ ์•Š์€ data์— ๋Œ€ํ•œ ์„ฑ๋Šฅ Data set Training set training์— ์‚ฌ์šฉํ•˜๋Š” data set Validation set ์ฃผ์–ด์ง„ data set ์ค‘ ๋นผ๋†“์•˜๋‹ค๊ฐ€ ์„ฑ๋Šฅ์„ ๊ฒ€์ฆํ•  ๋•Œ ์‚ฌ์šฉํ•˜๋Š” data set Test set ์ฃผ์–ด์ง€์ง€ ์•Š์•˜๋˜ ์ ‘ํ•œ ์  ์—†๋Š” data set ํ•™์Šต์ด training set์œผ๋กœ ์ง„ํ–‰๋˜๊ธฐ ๋•Œ๋ฌธ์—, ํ•™์Šต์„ ๋ฐ˜๋ณตํ•  ์ˆ˜๋ก training set์— ๋Œ€ํ•œ ์ •ํ™•๋„๋Š” ๋†’์•„์ง€๊ณ  ์˜ค๋ฅ˜์œจ์€ ๋‚ฎ์•„์ง„๋‹ค. ํ•˜์ง€๋งŒ validation set์— ๋Œ€ํ•œ ์˜ค๋ฅ˜์œจ์€ ๋‚ฎ์•„์ง€๋‹ค๊ฐ€ ๋†’์•„์ง€๋Š” ํ˜„์ƒ์„ ๋„๋Š”๋ฐ, ์ด๋Š” ํ•™์Šต์ด ๋„ˆ๋ฌด training set์—๋งŒ ์ ํ•ฉํ•˜๊ฒŒ ๋˜์—ˆ๊ธฐ ๋•Œ๋ฌธ์ด๋‹ค. ์ด๋ฅผ training set์— overfitting(..

์ธ๊ณต์ง€๋Šฅ ์ •๋ฆฌ [๋ณธ๋ก 4] :: ๋”ฅ๋Ÿฌ๋‹์˜ ์‹œ์ž‘
์ปดํ“จํ„ฐ๊ณผํ•™ (CS)/AI 2020. 2. 1. 00:50

๋”ฅ๋Ÿฌ๋‹์˜ ์‹œ์ž‘ ์ด๋Ÿฌํ•œ multi-layer ์˜ forward-propagation ๊ณผ์ •์„ ์‹์œผ๋กœ ๋‚˜ํƒ€๋‚ด๋ณด๋ฉด, h1 = f(x11*w11+x12*w21) net = h1*w13+h2*w23 = f(x11*w11+x12*w21)*w13+f(x11*w12+x12*w22)*w23 ์—ฌ๊ธฐ์„œ f ์ฆ‰, activation fuction์ด linearํ•œ function์ด๋ผ๊ณ  ๊ฐ€์ •ํ•ด๋ณด์ž. ๊ทธ๋ ‡๋‹ค๋ฉด f(x) = ax์˜ ํ˜•ํƒœ์ด๋ฏ€๋กœ, net = x11*a(w11*w13+w12*w23)+x12*a(w21*w13+w22*w23) ์œผ๋กœ ๋‚˜ํƒ€๋‚ผ ์ˆ˜ ์žˆ๋‹ค. ๊ทธ๋ฆฌ๊ณ  ์ด๋Ÿฌํ•œ net์€ ๊ฐ€์žฅ ์ฒ˜์Œ์— ์ฃผ์–ด์ง„ input layer์—๋‹ค๊ฐ€ ์ƒ์ˆ˜๋ฅผ ๊ณฑํ•œ ๊ผด์ด๋ฏ€๋กœ one-layer๋กœ ๋‚˜ํƒ€๋‚ผ ์ˆ˜ ์žˆ๋‹ค. ์ฆ‰, ์—ฌ๋Ÿฌ ๊ฐœ์˜ layer๋ฅผ ๊ฑฐ์ณค์Œ์—๋„ ์‰ฌ์šด ๋ฌธ์ œ๋กœ ..