Dataset information

Row

Raw data’s feature

Feature Description
sudden Sudden Sensorineural Hearing Loss happen
ID A unique identifier for each individual
sex The biological sex
age The age of the individual
hypertension This individual has hypertension
dyslipidemia This individual has dyslipidemia, which is an abnormal amount of lipids in the blood
stroke This individual has stroke
MI/angina This individual has Myocardial infarction
Diabetes This individual has diabetes.
Thyroid This individual has thyroid dysfunction
CRF This individual has Chronic Renal Failure
Height The height of the individual, in centimeters
Weight The weight of the individual, in kilograms
BMI Body Mass Index of the individual
HE_wc Waist circumference, possibly measured at the health examination in unit of centimeters
TOTALchole Total cholesterol levels in the blood
HDL High-Density Lipoprotein cholesterol levels
TG Triglycerides
LDLcal Low-Density Lipoprotein cholesterol
Phospholipid Phospholipid levels in the blood.
Free Fatty Acid Free fatty acid levels in the blood.
Lipoprotein Likely indicates the concentration of a certain lipoprotein subclass.
Lipid_total Total lipid levels in the blood.
Followup_weeks The number of weeks of follow-up after an initial event or diagnosis.
Siegel_criteria Likely a scoring or classification system related to the study, perhaps to grade the severity of hearing loss.
Initial_audio(dB) The level of hearing loss at first diagnosis.
Last_audio(dB) The current level of hearing loss.
AUDIORt An audio test result for the right ear
AUDIOLt An audio test result for the left ear

Row

Cleaned feature used in this report

Feature Description
sudden Sudden Sensorineural Hearing Loss happen
sex The biological sex
age The age of the individual
hypertension This individual have hypertension
dyslipidemia This individual have dyslipidemia, which is an abnormal amount of lipids in the blood.
stroke This individual have stroke.
MI_angina This individual have Myocardial infarction
Diabetes This individual have diabetes.
Thyroid This individual have thyroid dysfunction
CRF This individual have Chronic Renal Failure
Height The height of the individual, in centimeters.
Weight The weight of the individual, in kilograms.
BMI Body Mass Index of the individual.
HDL High-Density Lipoprotein cholesterol levels
TG Triglycerides
LDLcal Low-Density Lipoprotein cholesterol

Multiple Regression

Row

Find Best Subset Regression Evaluation

Row

Multiple Regression Model for TG

Coefficients of the Multiple Regression Model for TG
Estimate Std. Error t value Pr(>|t|)
(Intercept) 121.622 17.627 6.900 0.000
age 0.499 0.139 3.588 0.000
CRF 93.973 31.132 3.019 0.003
Weight 0.890 0.180 4.945 0.000
HDL -1.615 0.154 -10.498 0.000

Ridge Regression

Row

Ridge Regression: using all feature to predict TG

Coefficients of the Ridge Regression Model for TG
Term Coefficient
(Intercept) 138.745
sudden 13.721
sex -8.458
age 0.274
hypertension 5.976
dyslipidemia 22.123
stroke 8.175
MI_angina 5.920
Diabetes 8.316
Thyroid -21.576
CRF 60.607
Height -0.073
Weight 0.316
BMI 1.449
HDL -1.259
LDLcal -0.044

Row

Ridge Regression: using all feature to predict TG

Evaluation Value for Ridge Regression
Metric Value
MSE 3248.609
R2 0.182
Adjusted_R2 0.149

Row

Ridge Regression: Log Tranformation on TG

Evaluation Value for Ridge Regression After Log Transformation
Metric Value
MSE 72.087
R2 0.982
Adjusted_R2 0.981

natural cubic spline

Row

Scatter Plot for TG vs. HDL & Degree of Freedom Test.

Row

Scatter Plot With Natural Cubic Spline

Evaluation Value for Natural Cubic Spline
Metric Value
in sample MSE 3892.0849
in sample R2 0.1586
out of sample MSE 3194.8315
out of sample R2 0.1956

kNN classifiication

Row

kNN classifiication

Row

kNN Table
KNN Classification Results for Different Values of k
k True_Negatives False_Positives False_Negatives True_Positives
3 158 57 35 8
5 163 57 30 8
10 180 60 13 5

Naive Bayes Classification

Row

Naive Bayes Classification


Bayes Table
Performance Metrics
Performance Metrics
Metric Value
Accuracy 0.7209302
Precision 0.7320000
Recall 0.9734043
F1 Score 0.8356164

Classification using Logistic Regression

Row

Classification using Logistic Regression


Logistic Regression Table
Logistic Regression Classification Confusion Matrix
Actual Class
0 1
0 962 316
1 7 8
Logistic Regression Performance Metrics
Performance Metrics
Accuracy Precision Recall F1.Score
0.75 0.53 0.02 0.05