'Programming/Deep Learning' 카테고리의 글 목록 (2 Page)

Programming/Deep Learning

[하둡] 파이썬을 이용한 하둡 어플리케이션 - word count 2017.04.12
[텐서플로우] MNIST 고급 - 예제 2017.04.06
[텐서플로우] MNIST 초급 - 예제 2017.04.06
[딥 러닝] 싱글 뉴런 학습 시키기 - 역전파(back propagation) 구현하기 2017.02.06
[딥 러닝] 싱글 뉴런의 작동원리 Feed-forward 구현하기 2017.02.06

[하둡] 파이썬을 이용한 하둡 어플리케이션 - word count

2017. 4. 12. 19:55

하둡 어플리케이션은 보통 Mapper -> Shuffle -> Reducer 순으로 작업을 진행한다.

파이썬 예제와 함께 각각의 결과물을 확인한다.

1. Mapper

#!/usr/bin/env python import sys for line in sys.stdin: line = line.strip() keys = line.split() for key in keys: value = 1 print( "%s\t%d" % (key, value) )

$ cat wordcount_mapper.py | python ./wordcount_mapper.py > output_mapper.txt
$ cat output_mapper.txt
import	1
sys	1
for	1
line	1
in	1
sys.stdin:	1
line	1
=	1
line.strip()	1
keys	1
=	1
line.split()	1
for	1
key	1
in	1
keys:	1
value	1
=	1
1	1
print("{0}\t{1}".format(key,value))	1

2. Shuffle

$ cat output_mapper.txt | sort > output_sort.txt
$ cat output_sort.txt
=	1
=	1
=	1
1	1
for	1
for	1
import	1
in	1
in	1
key	1
keys:	1
keys	1
line	1
line	1
line.split()	1
line.strip()	1
print("{0}\t{1}".format(key,value))	1
sys	1
sys.stdin:	1
value	1

3. Reducer

#!/usr/bin/env python import sys last_key = None running_total = 0 for input_line in sys.stdin: input_line = input_line.strip() this_key, value = input_line.split("\t", 1) value = int(value) if last_key == this_key: running_total += value else: if last_key: print( "%s\t%d" % (last_key, running_total) ) running_total = value last_key = this_key if last_key == this_key: print( "%s\t%d" % (last_key, running_total) )

$ cat output_sort.txt | python wordcount_reducer.py > output_reducer.txt
$ cat output_reducer.txt 
=	3
1	1
for	2
import	1
in	2
key	1
keys:	1
keys	1
line	2
line.split()	1
line.strip()	1
print("{0}\t{1}".format(key,value))	1
sys	1
sys.stdin:	1
value	1

4. Mapper | Shuffle | Reducer

$ cat wordcount_mapper.py | python wordcount_mapper.py | sort | python wordcount_reducer.py > output.txt
$ cat output.txt
=	3
1	1
for	2
import	1
in	2
key	1
keys:	1
keys	1
line	2
line.split()	1
line.strip()	1
print("{0}\t{1}".format(key,value))	1
sys	1
sys.stdin:	1
value	1

딥 러닝에 대해 독학을 하면서 정리한 걸 적고 있습니다.

전공과 무관하며 전문적인 지식이 아니므로 개인적인 의견과 부족하고 틀린 점이 많습니다.

추가 지식 및 잘못된 점을 지적해주시면 공부하는데 많은 도움이 되겠습니다. 감사합니다^^

- 푸어맨

[Reference]

(Writing Hadoop Applications in Python with Hadoop Streaming) http://www.glennklockwood.com/data-intensive/hadoop/streaming.htm

(하둡 스트리밍을 활용한 word count 예제) http://blog.acronym.co.kr/606

(파이썬 문자열 관련함수) http://agiantmind.tistory.com/31

'Programming > Deep Learning' 카테고리의 다른 글

[pyplot] MNIST 글자 이미지 띄우기 (1)	2017.08.01
MNIST, CIFAR-10, CIFAR-100, STL-10, SVHN ILSVRC2012 task 1 - 인식률 랭킹 (0)	2017.04.13
[텐서플로우] MNIST 고급 - 예제 (0)	2017.04.06
[텐서플로우] MNIST 초급 - 예제 (0)	2017.04.06
[딥 러닝] 싱글 뉴런 학습 시키기 - 역전파(back propagation) 구현하기 (0)	2017.02.06

[텐서플로우] MNIST 고급 - 예제

2017. 4. 6. 18:27

import tensorflow as tf
import input_data

# gradient sets 0.1 instead of 0
def weight_variable(shape):
	initial = tf.truncated_normal(shape, stddev=0.1)
	return tf.Variable(initial)

def bias_variable(shape):
	initial = tf.constant(0.1, shape=shape)
	return tf.Variable(initial)

# Convolution
def conv2d(x, W):
	return tf.nn.conv2d(x, W, strides=[1,1,1,1], padding='SAME')

# Pooling
def max_pool_2x2(x):
	return tf.nn.max_pool(x, ksize=[1,2,2,1], strides=[1,2,2,1], padding='SAME')

# mnist data sets
mnist = input_data.read_data_sets("MNIST_data/", one_hot=True)

# mnist data vectors 784(28x28)
x = tf.placeholder(tf.float32, [None, 784])

# 1 layer
# widow size1,2, input channel, output channel
W_conv1 = weight_variable([5,5,1,32])
b_conv1 = bias_variable([32])

# reshape to 4D tensor
x_image = tf.reshape(x, [-1,28,28,1]) # -1,img_x,img_y,color channel

# ReLu
h_conv1 = tf.nn.relu(conv2d(x_image, W_conv1) + b_conv1)
h_pool1 = max_pool_2x2(h_conv1)

# 2 layer
W_conv2 = weight_variable([5,5,32,64])
b_conv2 = bias_variable([64])
# ReLu
h_conv2 = tf.nn.relu(conv2d(h_pool1, W_conv2) + b_conv2)
h_pool2 = max_pool_2x2(h_conv2)

# Fully-Connected Layer
W_fc1 = weight_variable([7*7*64, 1024])
b_fc1 = bias_variable([1024])

h_pool2_flat = tf.reshape(h_pool2, [-1, 7*7*64])
h_fc1 = tf.nn.relu(tf.matmul(h_pool2_flat, W_fc1) + b_fc1)

# Dropout
keep_prob = tf.placeholder(tf.float32)
h_fc1_drop = tf.nn.dropout(h_fc1, keep_prob)

# last softmax
W_fc2 = weight_variable([1024, 10])
b_fc2 = bias_variable([10])

y_conv=tf.nn.softmax(tf.matmul(h_fc1_drop, W_fc2) + b_fc2)

# loss - valid result
y_ = tf.placeholder(tf.float32, [None, 10])

# loss - cross entropy
cross_entropy = tf.reduce_mean(-tf.reduce_sum(y_ * tf.log(y_conv), reduction_indices=[1]))
train_step = tf.train.AdamOptimizer(1e-4).minimize(cross_entropy)

#evaluate
correct_prediction = tf.equal(tf.argmax(y_conv,1), tf.argmax(y_,1))
accuracy = tf.reduce_mean(tf.cast(correct_prediction, tf.float32))

# init Session
sess = tf.Session()
sess.run(tf.initialize_all_variables())

# do learning 20000 counts
for i in range(20000):
	batch = mnist.train.next_batch(50)
	if i%100 == 0:
		train_accuracy = accuracy.eval(session=sess, feed_dict={x:batch[0], y_:batch[1], keep_prob:1.0})
		print("step %d, training accuracy %g" % (i, train_accuracy))
	train_step.run(session=sess, feed_dict={x: batch[0], y_: batch[1], keep_prob: 0.5})
print("* Finished Learning *")

# test accuracy
print("test accuracy %g" % accuracy.eval(feed_dict={x: mnist.test.images, y_: mnist.test.labels, keep_prob: 1.0}))

딥 러닝에 대해 독학을 하면서 정리한 걸 적고 있습니다.

전공과 무관하며 전문적인 지식이 아니므로 개인적인 의견과 부족하고 틀린 점이 많습니다.

추가 지식 및 잘못된 점을 지적해주시면 공부하는데 많은 도움이 되겠습니다. 감사합니다^^

- 푸어맨

[Reference]

(MNIST 고급) https://tensorflowkorea.gitbooks.io/tensorflow-kr/content/g3doc/tutorials/mnist/pros/

(모두를 위한 머신러닝/딥러닝 강의) http://hunkim.github.io/ml/

'Programming > Deep Learning' 카테고리의 다른 글

MNIST, CIFAR-10, CIFAR-100, STL-10, SVHN ILSVRC2012 task 1 - 인식률 랭킹 (0)	2017.04.13
[하둡] 파이썬을 이용한 하둡 어플리케이션 - word count (0)	2017.04.12
[텐서플로우] MNIST 초급 - 예제 (0)	2017.04.06
[딥 러닝] 싱글 뉴런 학습 시키기 - 역전파(back propagation) 구현하기 (0)	2017.02.06
[딥 러닝] 싱글 뉴런의 작동원리 Feed-forward 구현하기 (0)	2017.02.06

[텐서플로우] MNIST 초급 - 예제

2017. 4. 6. 16:18

import tensorflow as tf
import input_data

# mnist data sets
mnist = input_data.read_data_sets("MNIST_data/", one_hot=True)

# mnist data vectors 784(28x28)
x = tf.placeholder(tf.float32, [None, 784])

# (x * W) + b = [None,10]
W = tf.Variable(tf.zeros([784, 10]))
b = tf.Variable(tf.zeros([10]))

# neural network softmax
y = tf.nn.softmax(tf.matmul(x, W) + b)

# loss - valid result
y_ = tf.placeholder(tf.float32, [None, 10])
# loss - cross_entropy
cross_entropy = tf.reduce_mean(-tf.reduce_sum(y_ * tf.log(y), reduction_indices=[1]))

# train step - learning rate 0.5
train_step = tf.train.GradientDescentOptimizer(0.5).minimize(cross_entropy)

# init variable to start learning
init = tf.initialize_all_variables()

# init Session
sess = tf.Session()
sess.run(init)

# do learning 1000 counts
for i in range(1000):
	batch_xs, batch_ys = mnist.train.next_batch(100) # select random 100 data
	sess.run(train_step, feed_dict={x: batch_xs, y_: batch_ys})
print("* Finished Learning *")

# evaluate
correct_predict = tf.equal(tf.argmax(y,1), tf.argmax(y_,1))
accuracy = tf.reduce_mean(tf.cast(correct_predict, tf.float32))

print(sess.run(accuracy, feed_dict={x: mnist.test.images, y_: mnist.test.labels}))

딥 러닝에 대해 독학을 하면서 정리한 걸 적고 있습니다.

전공과 무관하며 전문적인 지식이 아니므로 개인적인 의견과 부족하고 틀린 점이 많습니다.

추가 지식 및 잘못된 점을 지적해주시면 공부하는데 많은 도움이 되겠습니다. 감사합니다^^

- 푸어맨

[Reference]

(MNIST 초급) https://tensorflowkorea.gitbooks.io/tensorflow-kr/content/g3doc/tutorials/mnist/beginners/

(모두를 위한 머신러닝/딥러닝 강의) http://hunkim.github.io/ml/

'Programming > Deep Learning' 카테고리의 다른 글

MNIST, CIFAR-10, CIFAR-100, STL-10, SVHN ILSVRC2012 task 1 - 인식률 랭킹 (0)	2017.04.13
[하둡] 파이썬을 이용한 하둡 어플리케이션 - word count (0)	2017.04.12
[텐서플로우] MNIST 고급 - 예제 (0)	2017.04.06
[딥 러닝] 싱글 뉴런 학습 시키기 - 역전파(back propagation) 구현하기 (0)	2017.02.06
[딥 러닝] 싱글 뉴런의 작동원리 Feed-forward 구현하기 (0)	2017.02.06

[딥 러닝] 싱글 뉴런 학습 시키기 - 역전파(back propagation) 구현하기

2017. 2. 6. 17:31

[딥 러닝] 싱글 뉴런 학습 시키기 - 역전파(back propagation) 구현하기

1. MFC로 다이얼로그 구성

1) Toolbox

2. 소스 코드

1) Neuron 클래스에 back propagation 함수 추가

#define MAX2(a,b) (a) > (b) ? (a) : (b) class Neuron { public: double w_; // weight of one input double b_; // bias double input_, output_; // saved for back-prop public: Neuron() : w_(2.0), b_(1.0) {} Neuron(const double& w_input, const double& b_input) : w_(w_input), b_(b_input) {} double getAct(const double& x) { // for linear or identity activation functions return x; // for ReLU activation functions /*return MAX2(0.0, x);*/ } double getActGrad(const double& x) { // for linear or identity activation functions return 1.0; // for ReLU activation functions //if (x > 0.0) return 1.0; else return 0.0; } double feedForward(const double& input) { input_ = input; const double sigma = w_ * input + b_; output_ = getAct(sigma); return output_; } void propBackward(const double& target) { const double alpha = 0.1; // learning rate const double grad = (output_ - target) * getActGrad(output_); double dw_ = alpha * grad * input_; // last input_) came from d(wx+b)/dw = x double db_ = alpha * grad * 1.0; // last input_) came from d(wx+b)/db = 1 w_ -= dw_; b_ -= db_; } };

2) Back-Prop 버튼과 target 에디트 박스 값과 연동

static Neuron my_neuron;
void CDeepLearningDlg::OnBnClickedButtonCalculate()
{
	// TODO: Add your control notification handler code here
	my_neuron.w_ = GetEditToDouble(m_edit_weight);
	my_neuron.b_ = GetEditToDouble(m_edit_bias);
	
	double input = GetEditToDouble(m_edit_input_x);
	double output = my_neuron.feedForward(input);	
	SetDoubleToEdit(&m_edit_output_y, output);
}

#define ROUNDING(x, dig)    ( floor((x) * pow(float(10), dig) + 0.5f) / pow(float(10), dig) )
void CDeepLearningDlg::OnBnClickedButtonBackprop()
{
	// TODO: Add your control notification handler code here
	int count = 0;

double target = GetEditToDouble(m_edit_target);
	double output = GetEditToDouble(m_edit_output_y);

while (output != target)
	{
		target = GetEditToDouble(m_edit_target);

my_neuron.propBackward(target);

// set edit box
		double weight = my_neuron.w_;
		double bias = my_neuron.b_;
		SetDoubleToEdit(&m_edit_weight, weight);
		SetDoubleToEdit(&m_edit_bias, bias);

OnBnClickedButtonCalculate();
		
		// edit box threshold value
		output = GetEditToDouble(m_edit_output_y);
		output = ROUNDING(output, 5);
		SetDoubleToEdit(&m_edit_output_y, output);
		// edit box threshold value
		if (GetEditToDouble(m_edit_weight) != weight)
		{
			weight = ROUNDING(weight, 5);
			SetDoubleToEdit(&m_edit_weight, weight);
		}
		if (GetEditToDouble(m_edit_bias) != bias)
		{
			bias = ROUNDING(bias, 5);
			SetDoubleToEdit(&m_edit_bias, bias);
		}

// printf
		count++;
		cout << " - Count " << count << endl;
		cout << "w_" << my_neuron.w_ << " b_" << my_neuron.b_ << " y_" << my_neuron.output_ << endl;
	}
	cout << " * Calculate Count : " << count << " *" << endl;
}

3. 실행 결과

초기값 설정

x : 1, Weight : 2, Bias : 3, y : 5, target : 13

딥 러닝에 대해 독학을 하면서 정리한 걸 적고 있습니다.

전공과 무관하며 전문적인 지식이 아니므로 개인적인 의견과 부족하고 틀린 점이 많습니다.

추가 지식 및 잘못된 점을 지적해주시면 공부하는데 많은 도움이 되겠습니다. 감사합니다^^

- 푸어맨

[Reference]

(MFC에서 콘솔창 띄우기) http://poorman.tistory.com/63

(역전파 구현하기) http://blog.naver.com/atelierjpro/220703090092

'Programming > Deep Learning' 카테고리의 다른 글

MNIST, CIFAR-10, CIFAR-100, STL-10, SVHN ILSVRC2012 task 1 - 인식률 랭킹 (0)	2017.04.13
[하둡] 파이썬을 이용한 하둡 어플리케이션 - word count (0)	2017.04.12
[텐서플로우] MNIST 고급 - 예제 (0)	2017.04.06
[텐서플로우] MNIST 초급 - 예제 (0)	2017.04.06
[딥 러닝] 싱글 뉴런의 작동원리 Feed-forward 구현하기 (0)	2017.02.06

[딥 러닝] 싱글 뉴런의 작동원리 Feed-forward 구현하기

2017. 2. 6. 15:08

[딥 러닝] 싱글 뉴런의 작동원리 (Feed-forward 구현하기)

1. MFC로 다이얼로그 구성

1) Toolbox

2) Editbox의 CString 값을 Double로 Get, Set하는 함수 생성

2. Feed-foward 함수 클래스 생성 및 계산 결과 표시

3. 실행 결과

딥 러닝에 대해 독학을 하면서 정리한 걸 적고 있습니다.

전공과 무관하며 전문적인 지식이 아니므로 개인적인 의견과 부족하고 틀린 점이 많습니다.

추가 지식 및 잘못된 점을 지적해주시면 공부하는데 많은 도움이 되겠습니다. 감사합니다^^

- 푸어맨

[Reference]

(위키백과) https://ko.wikipedia.org/wiki/%EB%94%A5_%EB%9F%AC%EB%8B%9D

(C++로 배우는 딥러닝) http://m.blog.naver.com/atelierjpro/220697890605

(C++로 Feed-forward 구현하기) http://blog.naver.com/atelierjpro/220697902502

(인공 뉴런의 작동원리) http://blog.naver.com/atelierjpro/220697901074

'Programming > Deep Learning' 카테고리의 다른 글

MNIST, CIFAR-10, CIFAR-100, STL-10, SVHN ILSVRC2012 task 1 - 인식률 랭킹 (0)	2017.04.13
[하둡] 파이썬을 이용한 하둡 어플리케이션 - word count (0)	2017.04.12
[텐서플로우] MNIST 고급 - 예제 (0)	2017.04.06
[텐서플로우] MNIST 초급 - 예제 (0)	2017.04.06
[딥 러닝] 싱글 뉴런 학습 시키기 - 역전파(back propagation) 구현하기 (0)	2017.02.06

PREV 1 2 NEXT

poorman