You cannot select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
df9e95f0b5 | 13 years ago | |
---|---|---|
.. | ||
anes96 | 13 years ago | |
ccard | 13 years ago | |
committee | 13 years ago | |
copper | 13 years ago | |
cpunish | 13 years ago | |
grunfeld | 13 years ago | |
longley | 13 years ago | |
macrodata | 13 years ago | |
nile | 13 years ago | |
randhie | 13 years ago | |
scotland | 13 years ago | |
spector | 13 years ago | |
stackloss | 13 years ago | |
star98 | 13 years ago | |
strikes | 13 years ago | |
sunspots | 13 years ago | |
COPYING | 13 years ago | |
README.txt | 13 years ago | |
__init__.py | 13 years ago | |
template_data.py | 13 years ago |
README.txt
This README was copied from http://projects.scipy.org/scikits/browser/trunk/learn/scikits/learn/datasets/ ----------------------------------------------------------------------------- Last Change: Tue Jul 17 04:00 PM 2007 J This packages datasets defines a set of packages which contain datasets useful for demo, examples, etc... This can be seen as an equivalent of the R dataset package, but for python. Each subdir is a python package, and should define the function load, returning the corresponding data. For example, to access datasets data1, you should be able to do: >> from datasets.data1 import load >> d = load() # -> d contains the data of the datasets data1 load can do whatever it wants: fetching data from a file (python script, csv file, etc...), from the internet, etc... Some special variables must be defined for each package, containing a python string: - COPYRIGHT: copyright informations - SOURCE: where the data are coming from - DESCHOSRT: short description - DESCLONG: long description - NOTE: some notes on the datasets. For the datasets to be useful in the learn scikits, which is the project which initiated this datasets package, the data returned by load has to be a dict with the following conventions: - 'data': this value should be a record array containing the actual data. - 'label': this value should be a rank 1 array of integers, contains the label index for each sample, that is label[i] should be the label index of data[i]. - 'class': a record array such as class[i] is the class name. In other words, this makes the correspondance label index <> label name.