পর্ব ২০ · ডিপ লার্নিং-এ ক্যালকুলাস

নিউরাল নেটওয়ার্ক

Universal approximator হিসেবে গাণিতিক গঠন

শেখার লক্ষ্য

a = \sigma(\mathbf{w}^\top \mathbf{x} + b)

Linear combination + non-linear activation — এটিই basic unit।

একটি hidden layer সহ feedforward network (যথেষ্ট neuron হলে) যেকোনো continuous ফাংশন compact set-এ যেকোনো নির্ভুলতায় approximate করতে পারে।

Depth → হায়ারার্কিক্যাল feature; প্রতিটি লেয়ার পূর্বের উপর গড়ে।

একই accuracy পেতে deep network-এ exponentially কম parameter লাগে।

CNN: স্থানীয় filter + shared weights — image-এর spatial structure।

RNN/LSTM: ক্রম + memory।

Transformer: attention দিয়ে global dependency।