群馬大学 | 医学部 | サイトトップ | 医学情報処理演習
A file named sample12.txt includes prefecture-based age-adjusted mortalities specific for major cause of death by males and females in 2005. Variables are: ALLM and ALLF are age-adjusted mortality by all causes, CANCERM and CANCERF are by neoplasms (cancer), CARDIOM and CARDIOF are by cardiovascular disease, CEREBROM and CEREBROF are by cerebrovascular disease, PNEUMOMand PNEUMOF are by pneumonia, ACCIDENTM and ACCIDENTF are by accident, SUICIDEM and SUICIDEF are by suicide, SCENILM and SCENILF are by scenescence, KIDNEYFM and KIDNEYFF are by kidney failure, LIVERDM and LIVERDF are by liver disease, COPDM and COPDF are by chronic occlusive pneumonic disease (COPD), and DIABETESM and DIABETESF are by diabetes. The file also contains aggregate measure of socioeconomic status of the prefecture as the following variables: HHSAVINGS is average amount of savings per household (unit: 1,000 yen) in 2004, AVETEMP is annual average temperature in 2005, MYCAR is average number of car ownership for private use per household in 2005, PRODUCTS is total industrial products (unit: million yen) in 2005, POP is total population (as ordinal household member) in 2005, and PRODPP is the ratio of PRODUCTS to POP. The sources of data are in the website of the ministry of health, labor and welfare, the e-STAT, and the "Japanese industry from the viewpoint of statistics" in the website of the ministry of economics, technology and industry. The file also includes the variables PREF (the name of prefecture in Japanese) and AREA (included either in the eastern part of Japan or in the western part of Japan in Japanese; there are several criteria to divide Japan into east and west, but here I used the most east-wide criterion, where the west-end of eastern Japan is Fukui, Gifu, Aichi and Mie prefectures).
Using the Wilcoxon's rank sum test, examine the statistical difference of the age-adjusted mortalities by chronic occlusive pneumonic disease (COPDM and COPDF) between eastern Japan and western Japan (indicated by AREA). Let the significance level (alpha-error) 0.05.
Please write the registry number and name, fill the boxes by adequate characters.
(The code is shown below.)
x <- read.delim("http://phi.med.gunma-u.ac.jp/medstat/sample12.txt")
layout(1:2) # グラフィック画面を上下2分割
stripchart(COPDM ~ AREA,=x, vert=TRUE,
stripchart(COPDF ~ AREA,=x, vert=TRUE,
=x, exact=FALSE) # Wilcoxon's rank sum test for COPDM by AREA
=x, exact=FALSE) # Wilcoxon's rank sum test for COPDF by AREA
According to the graph, there is a prefecture with exceptionally high COPD mortality in either of males and females (in fact, it's Okinawa prefecture). Thus, the Wilcoxon's rank sum test is more suitable to compare the COPD mortalities between Eastern and Western Japan than t-test.
Judging the results of Wilcoxon's rank sum test, we can state that the difference of COPD mortality between Eastern and Western Japan was statistically significant (at 0.05 level) (1: in both of males and females, 2: neither in males nor females, 3: in males but not in females, 4: in females but not in males).
項目 | 解答 |
![]() | data |
![]() | method |
![]() | wilcox.test |
![]() | 3 |
AとBはDATA, detach,methotなどミスタイプを除けば全員正解であった。Cはpairwise.wilcox.testという誤答がかなりあったが,それだとエラーが出て動作しないはずである。Dは1または2という誤答が何人かあった。正しく穴埋めしたコードを実行すると,右の図ができ,下の結果が出力される。
Wilcoxon rank sum test with continuity correction data: COPDM by AREA W = 381.5, p-value = 0.02533 alternative hypothesis: true location shift is not equal to 0女性の結果
Wilcoxon rank sum test with continuity correction data: COPDF by AREA W = 338.5, p-value = 0.1850 alternative hypothesis: true location shift is not equal to 0