hub / github.com/lazyprogrammer/machine_learning_examples / fit

Method fit

rnn_class/wiki.py:31–136 · view source on GitHub ↗

(self, X, learning_rate=1e-5, mu=0.99, epochs=10, show_fig=True, activation=T.nnet.relu, RecurrentUnit=GRU, normalize=True)

Source from the content-addressed store, hash-verified

29	self.V = V
30
31	def fit(self, X, learning_rate=1e-5, mu=0.99, epochs=10, show_fig=True, activation=T.nnet.relu, RecurrentUnit=GRU, normalize=True):
32	D = self.D
33	V = self.V
34	N = len(X)
35
36	We = init_weight(V, D)
37	self.hidden_layers = []
38	Mi = D
39	for Mo in self.hidden_layer_sizes:
40	ru = RecurrentUnit(Mi, Mo, activation)
41	self.hidden_layers.append(ru)
42	Mi = Mo
43
44	Wo = init_weight(Mi, V)
45	bo = np.zeros(V)
46
47	self.We = theano.shared(We)
48	self.Wo = theano.shared(Wo)
49	self.bo = theano.shared(bo)
50	self.params = [self.Wo, self.bo]
51	for ru in self.hidden_layers:
52	self.params += ru.params
53
54	thX = T.ivector('X')
55	thY = T.ivector('Y')
56
57	Z = self.We[thX]
58	for ru in self.hidden_layers:
59	Z = ru.output(Z)
60	py_x = T.nnet.softmax(Z.dot(self.Wo) + self.bo)
61
62	prediction = T.argmax(py_x, axis=1)
63	# let's return py_x too so we can draw a sample instead
64	self.predict_op = theano.function(
65	inputs=[thX],
66	outputs=[py_x, prediction],
67	allow_input_downcast=True,
68	)
69
70	cost = -T.mean(T.log(py_x[T.arange(thY.shape[0]), thY]))
71	grads = T.grad(cost, self.params)
72	dparams = [theano.shared(p.get_value()*0) for p in self.params]
73
74	dWe = theano.shared(self.We.get_value()*0)
75	gWe = T.grad(cost, self.We)
76	dWe_update = mudWe - learning_rategWe
77	We_update = self.We + dWe_update
78	if normalize:
79	We_update /= We_update.norm(2)
80
81	updates = [
82	(p, p + mudp - learning_rateg) for p, dp, g in zip(self.params, dparams, grads)
83	] + [
84	(dp, mudp - learning_rateg) for dp, g in zip(dparams, grads)
85	] + [
86	(self.We, We_update), (dWe, dWe_update)
87	]
88

Callers 8

train_wikipediaFunction · 0.95

__init__Method · 0.45

get_scalerFunction · 0.45

__init__Method · 0.45

Calls 3

init_weightFunction · 0.90

outputMethod · 0.45

gradMethod · 0.45

Tested by

no test coverage detected