码迷,mamicode.com
首页 > 其他好文 > 详细

将mnist数据集存储到本地文件

时间:2018-09-29 23:49:43      阅读:647      评论:0      收藏:0      [点我收藏+]

标签:lib   val   binary   map   div   set   struct   code   ima   

参考文章:

http://www.csuldw.com/2016/02/25/2016-02-25-machine-learning-MNIST-dataset/

import numpy as np
import struct
import matplotlib.pyplot as plt
import os
filename = data_AI/MNIST/train-images.idx3-ubyte
binfile = open(filename , rb)
buf = binfile.read()
 
index = 0
magic, numImages , numRows , numColumns = struct.unpack_from(>IIII , buf , index)
index += struct.calcsize(IIII )
images = []
for i in range(numImages):
    imgVal = struct.unpack_from(>784B, buf, index)
    index += struct.calcsize(>784B)
    imgVal = list(imgVal)
    for j in range(len(imgVal)):
        if imgVal[j] > 1:
            imgVal[j] = 1

    images.append(imgVal)
arrX = np.array(images)

# 读取标签
binFile = open(data_AI/MNIST/train-labels.idx1-ubyte,rb)
buf = binFile.read()
binFile.close()
index = 0
magic, numItems= struct.unpack_from(>II, buf,index)
index += struct.calcsize(>II)
labels = []
for x in range(numItems):
    im = struct.unpack_from(>1B,buf,index)
    index += struct.calcsize(>1B)
    labels.append(im[0])
arrY = np.array(labels)
print(np.shape(arrY))

# print(np.shape(trainX))
#以下内容是将图像保存到本地文件中
path_trainset = "data_AI/MNIST/imgs_train"
path_testset = "data_AI/MNIST/imgs_test"
if not os.path.exists(path_trainset):
   os.mkdir(path_trainset)
if not os.path.exists(path_testset):
   os.mkdir(path_testset)
for i in range(1):
    img = np.array(arrX[i])
    print(img)
    img = img.reshape(28,28)
    outfile = str(i) + "_" +  str(arrY[i]) + ".png"
    # outfile = str(i)+".png"
    plt.figure()
    plt.imshow(img, cmap = binary) #将图像黑白显示
    plt.savefig(path_trainset + "/" + outfile)
    print("save"+str(i)+"")

 

将mnist数据集存储到本地文件

标签:lib   val   binary   map   div   set   struct   code   ima   

原文地址:https://www.cnblogs.com/ncuhwxiong/p/9726936.html

(0)
(0)
   
举报
评论 一句话评论(0
登录后才能评论!
© 2014 mamicode.com 版权所有  联系我们:gaon5@hotmail.com
迷上了代码!