码迷,mamicode.com
首页 > 编程语言 > 详细

python 将数据随机分为训练集和测试集

时间:2015-06-23 19:57:01      阅读:471      评论:0      收藏:0      [点我收藏+]

标签:

# -*- coding: utf-8 -*-
"""
Created on Tue Jun 23 15:24:19 2015

@author: hd
"""

from sklearn import cross_validation

c = []
j=0
filename = r‘C:\Users\hd\Desktop\bookmarks\bookmarks.arff‘ 
out_train = open(r‘C:\Users\hd\Desktop\bookmarks\train.arff‘,‘w‘)
out_test = open(r‘C:\Users\hd\Desktop\bookmarks\test.arff‘,‘w‘)

for line in open(filename):
#    items = line.strip().split()
    c.append(line)
 
c_train,c_test = cross_validation.train_test_split(c,test_size = 0.6)
for i in c_train:
    out_train.write(i)
for i in c_test:
    out_test.write(i)

  

python 将数据随机分为训练集和测试集

标签:

原文地址:http://www.cnblogs.com/huadongw/p/4595949.html

(0)
(0)
   
举报
评论 一句话评论(0
登录后才能评论!
© 2014 mamicode.com 版权所有  联系我们:gaon5@hotmail.com
迷上了代码!