Cervical cancer
Files | Size | Format | Created | Updated | License | Source |
---|---|---|---|---|---|---|
1 | 95.9 kB | csv | almost 7 years ago | UCI - Cervical cancer (Risk Factors) Data Set |
This is dataset about cervical cancer occurrences. Cervical cancer is one the most frequent cancer diseases that occur to women. This dataset is showing some factors that might influence cervical ca...
Data Files
File | Description | Size | Last modified | Download |
---|---|---|---|---|
cervical-cancer | 95.9 kB | almost 7 years ago | cervical-cancer |
Data Previews
cervical-cancer
Schema
name | type | format |
---|---|---|
Age | integer | default |
Number of sexual partners | number | default |
First sexual intercourse | number | default |
Num of pregnancies | number | default |
Smokes | number | default |
Smokes (years) | number | default |
Smokes (packs/year) | number | default |
Hormonal Contraceptives | number | default |
Hormonal Contraceptives (years) | number | default |
IUD | number | default |
IUD (years) | number | default |
STDs | number | default |
STDs (number) | number | default |
STDs:condylomatosis | number | default |
STDs:cervical condylomatosis | number | default |
STDs:vaginal condylomatosis | number | default |
STDs:vulvo-perineal condylomatosis | number | default |
STDs:syphilis | number | default |
STDs:pelvic inflammatory disease | number | default |
STDs:genital herpes | number | default |
STDs:molluscum contagiosum | number | default |
STDs:AIDS | number | default |
STDs:HIV | number | default |
STDs:Hepatitis B | number | default |
STDs:HPV | number | default |
STDs: Number of diagnosis | integer | default |
STDs: Time since first diagnosis | string | default |
STDs: Time since last diagnosis | string | default |
Dx:Cancer | integer | default |
Dx:CIN | integer | default |
Dx:HPV | integer | default |
Dx | integer | default |
Hinselmann | integer | default |
Schiller | integer | default |
Citology | integer | default |
Biopsy | integer | default |
This is dataset about cervical cancer occurrences. Cervical cancer is one the most frequent cancer diseases that occur to women. This dataset is showing some factors that might influence cervical cancer.
Data
This dataset was found on UCI under the name Cervical cancer (Risk Factors) Data Set
The dataset was collected at 'Hospital Universitario de Caracas' in Caracas, Venezuela. The dataset comprises demographic information, habits, and historic medical records of 858 patients. Several patients decided not to answer some of the questions because of privacy concerns (missing values).
- 835 instances
- 36 attributes
- Missing values: yes
Output data is located in directory called data
data/cervical-cancer.csv
Attributes are the same as they were in input data.
Preparation
To get our output data several things are done to input data:
- missing values marked with "?" are replaced with ""(empty space)
Python scripts are located in directory scripts
scripts/main.py
License
Licensed under the Public Domain Dedication and License (assuming either no rights or public domain license in source data).