Data Frame Summary

dataset

Dimensions: 49747 x 10
Duplicates: 115
No Variable Stats / Values Freqs (% of Valid) Graph Valid Missing
1 price [numeric] Mean (sd) : 3938.8 (3994.6) min < med < max: 326 < 2401 < 18823 IQR (CV) : 4390 (1) 11270 distinct values 49747 (100.0%) 0 (0.0%)
2 carat [numeric] Mean (sd) : 0.9 (2.1) min < med < max: 0.2 < 0.7 < 50 IQR (CV) : 0.6 (2.3) 423 distinct values 49747 (100.0%) 0 (0.0%)
3 clarity [character] 1. I1 2. IF 3. SI1 4. SI2 5. VS1 6. VS2 7. VVS1 8. VVS2
689(1.4%)
1656(3.3%)
12058(24.2%)
8442(17.0%)
7533(15.1%)
11343(22.8%)
3362(6.8%)
4664(9.4%)
49747 (100.0%) 0 (0.0%)
4 cut [character] 1. Fair 2. Good 3. Ideal 4. Premium 5. Very Geod 6. Very Good
1477(3.0%)
4534(9.1%)
19810(39.8%)
12776(25.7%)
2232(4.5%)
8918(17.9%)
49747 (100.0%) 0 (0.0%)
5 color [character] 1. D 2. E 3. F 4. G 5. H 6. I 7. J
6226(12.5%)
9022(18.1%)
8786(17.7%)
10441(21.0%)
7666(15.4%)
5009(10.1%)
2597(5.2%)
49747 (100.0%) 0 (0.0%)
6 depth [numeric] Mean (sd) : 61.7 (1.4) min < med < max: 43 < 61.8 < 79 IQR (CV) : 1.5 (0) 179 distinct values 49277 (99.1%) 470 (0.9%)
7 table [numeric] Mean (sd) : 57.5 (2.2) min < med < max: 43 < 57 < 95 IQR (CV) : 3 (0) 126 distinct values 49359 (99.2%) 388 (0.8%)
8 x [numeric] Mean (sd) : 5.7 (1.1) min < med < max: 0 < 5.7 < 10.2 IQR (CV) : 1.8 (0.2) 549 distinct values 49527 (99.6%) 220 (0.4%)
9 y [numeric] Mean (sd) : 5.7 (1.1) min < med < max: 0 < 5.7 < 31.8 IQR (CV) : 1.8 (0.2) 547 distinct values 49415 (99.3%) 332 (0.7%)
10 z [numeric] Mean (sd) : 3.5 (0.7) min < med < max: 0 < 3.5 < 31.8 IQR (CV) : 1.1 (0.2) 371 distinct values 49322 (99.1%) 425 (0.9%)

Generated by summarytools 0.9.9 (R version 3.6.3)
2021-07-08