Breast cancer

•

Files	Size	Format	Created	Updated	License	Source
1	20.2 kB	csv		almost 7 years ago		OpenML - breast-cancer

This is a dataset about breast cancer occurrences. This dataset is taken from OpenML - breast-cancer This breast cancer domain was obtained from the University Medical Centre, Institute of Oncolog...

File	Description	Size	Last modified	Download
breast-cancer		20.2 kB	almost 7 years ago	breast-cancer

Data Previews

breast-cancer

Schema

name	type	format	description
age	string	default	10-19, 20-29, 30-39, 40-49, 50-59, 60-69, 70-79, 80-89, 90-99.
mefalsepause	string	default	lt40, ge40, premeno
tumor-size	string	default	0-4, 5-9, 10-14, 15-19, 20-24, 25-29, 30-34, 35-39, 40-44, 45-49, 50-54, 55-59
inv-falsedes	string	default	0-2, 3-5, 6-8, 9-11, 12-14, 15-17, 18-20, 21-23, 24-26, 27-29, 30-32, 33-35, 36-39
falsede-caps	boolean	default	yes, no
deg-malig	integer	default	1, 2, 3
breast	string	default	left, right
breast-quad	string	default	left-up, left-low, right-up, right-low, central
irradiat	boolean	default	yes, no
class	string	default	no-recurrence-events, recurrence-events

This is a dataset about breast cancer occurrences.

Data

This dataset is taken from OpenML - breast-cancer

This breast cancer domain was obtained from the University Medical Centre, Institute of Oncology, Ljubljana, Yugoslavia. Thanks go to M. Zwitter and M. Soklic for providing the data. Please include this citation if you plan to use this database.

Matjaz Zwitter & Milan Soklic (physicians) Institute of Oncology University Medical Center Ljubljana, Yugoslavia – Donors: Ming Tan and Jeff Schlimmer (Jeffrey.Schlimmer@a.gp.cs.cmu.edu) – Date: 11 July 1988.

286 instances
10 attributes
Missing values: yes

Class Distribution:

no-recurrence-events: 201 instances
recurrence-events: 85 instances

Output data

Output data is located in directory data

data/breast-cancer.csv

Scripts

Scripts for dataset are located in directory scripts

scripts/main.py

Licence

Licensed under the Public Domain Dedication and License (assuming either no rights or public domain license in source data).

Data Files