014 - False Discovery Rates, FDR, clearly explained

์ฅ์˜ RNA ์„œ์—ด์„ ๋ถ„์„ํ•  ๊ฑฐ๋‹ค. โ€œwildโ€ : โ€œnormalโ€

์ธก์ •์€ ํ•ญ์ƒ ์ •ํ™•ํ•  ์ˆ˜ ์—†๊ธฐ์—, ์กฐ๊ธˆ๋ผ ๋‹ค๋ฅผ ๊ฒƒ์ด๋‹ค.

False Discovery Rates FDR Notes Screenshot.png

๋ฐ˜๋ณตํ•ด์„œ ํ•ด๋ณด๋ฉด,

False Discovery Rates FDR Notes Feb 28.png

๊ทธ๋ฆฌ๊ณ  ์—ฌ๊ธฐ์„œ ๊ฐ๊ฐ์˜ ๋ฐ์ดํ„ฐ๋Š”

์ด๋Ÿฌํ•œ ์ „์ฒด ๋ถ„ํฌ์—์„œ ๊ฐ๊ฐ์˜ ๋ถ€๋ถ„์— ํ•ด๋‹นํ•˜๊ฒ ์ง€.

Transclude of False_Discovery_Rates_FDR_Notes_Screenshot_(1
.png)

์ด์ œ 3๋งˆ๋ฆฌ์˜ ์ฅ์—์„œ RNA-sequencing์„ ํ–ˆ๋‹ค๊ณ  ํ•ด๋ณด์ž.

์ „๋ถ€ ํ‰๊ท ์— ๋น„์Šทํ•˜๋‹ˆ, ๋ถ„ํฌ์˜ ์ค‘์•™์—์„œ ๊ธฐ์ธํ–ˆ๋‹ค๊ณ  ๋ณผ ์ˆ˜ ์žˆ๊ฒ ์ง€.

False Discovery Rates Notes Mar 12.png

์ถ”๊ฐ€๋กœ ๋‹ค๋ฅธ ์„ธ ๋งˆ๋ฆฌ์— ๋Œ€ํ•ด ์‹คํ—˜์„ ํ–ˆ๋‹ค๊ณ  ํ•ด๋ณด์ž.
์—ญ์‹œ ๋งˆ์ฐฌ๊ฐ€์ง€๋กœ, ๋ถ„ํฌ ์ค‘์•™์—์„œ ๊ธฐ์ธํ–ˆ๋‹ค๊ณ  ๋ณผ ์ˆ˜ ์žˆ์„ ๊ฒƒ์ด๋‹ค.

False Discovery Rates FDR Notes Screenshot Mar 18.png

False Discovery Rates FDR Notes Mar 30.png

๋งŒ์•ฝ ์ด ๋‘ ์ƒ˜ํ”Œ๋“ค(sample 1 & 2, ๊ฐ ์„ธ ๋งˆ๋ฆฌ)์— ๋Œ€ํ•ด ํ†ต๊ณ„ ๊ฒ€์ฆ์„ ํ•ด๋ณด๋ฉด, p-value๋Š” ํด ๊ฒƒ์ด๋‹ค.
๋‘ ์ง‘๋‹จ์˜ ํ‰๊ท ์ด ๋ชจํ‰๊ท ์— ๊ทผ์ฒ˜์— ์œ„์น˜ํ•˜์—ฌ, ํ‰๊ท ๊ฐ’๋งŒ์œผ๋กœ๋Š” ๋‘ ๊ฐ’์ด ๊ฐ๊ฐ ๋‹ค๋ฅธ ๋ถ„ํฌ์—์„œ ๊ธฐ์ธํ–ˆ๋‹ค๊ณ  ๋ณด๊ธด ์–ด๋ ค์šธ ํ…Œ๋‹ˆ.

์•„์ฃผ ์šฐ์—ฐํ•˜๊ฒŒ๋„ ์•„๋ž˜์™€ ๊ฐ™์ด ์ƒ˜ํ”Œ๋“ค์ด ๋ฝ‘ํ˜€, ํ‘œ๋ณธ ํ‰๊ท  ๊ฐ„ ์ฐจ์ด๊ฐ€ ๊ฐ๊ฐ ๋‹ค๋ฅธ ๋‘ ์ง‘๋‹จ์—์„œ ๊ธฐ์ธํ–ˆ๋‹ค๊ณ  ๋ณผ ์ˆ˜ ์žˆ์„ ๋งŒํผ ์ฐจ์ด๊ฐ€ ๋‚  ์ˆ˜๋„ ์žˆ๋‹ค. ์ด๋•Œ, p-value๋Š” ์ž‘์„ ๊ฒƒ์ด๋‹ค. ์ด๋Ÿฌํ•œ ๊ฒƒ์„ false-positive๋ผ๊ณ  ํ•œ๋‹ค. (type I error) : 5% ์œ ์˜ ์ˆ˜์ค€์œผ๋กœ ๊ฒ€์ฆ์„ ํ–ˆ๋‹ค๋ฉด, ๊ทธ 5%์— ์‹ค์ œ ํ•ด๋‹นํ•˜๋Š” ์ผ€์ด์Šค

Transclude of False_Discovery_Rates_FDR_Notes_Screenshot_(1
1.png)

False Discovery Rates Explained Screenshot.png

.05 ์ˆ˜์ค€ ํ†ต๊ณ„ ๊ฒ€์ฆ์„ ํ–ˆ๋‹ค๋ฉด, false-positive๋Š” 5% ๋‚ด์ด๊ฒ ์ง€.
์ผ๋ฐ˜์ ์œผ๋กœ๋Š” ํฌ์ง€ ์•Š์€ ์ˆ˜์น˜์ด์ง€๋งŒ, ์„ธํฌ์˜ ์ˆ˜๋ฅผ ์‹ค์ œ ๊ณ ๋ คํ•˜๋ฉด ๊ฝค ์ƒ๋‹นํ•œ ์ˆ˜์ด๋‹ค.
โ†’ domain์— ๋”ฐ๋ผ์„œ 5%์˜ ์ˆ˜์น˜๋„ critical ํ•  ์ˆ˜ ์žˆ๋Š”๋ฐ, ์ด๋ฅผ ์ค„์ด๊ธฐ ์œ„ํ•ด ์—ฌ๋Ÿฌ ๋ฐฉ๋ฒ•๋“ค์ด ์กด์žฌ.

  • FDRํ•˜๋ฉด ๋น„์œจ ๊ทธ ์ž์ฒด๋ฅผ ๋งํ•˜๋Š” ๊ฑฐ๋ผ, ์‹ค์ œ๋กœ๋Š” false positive์˜ ๋น„์œจ์„ ๋งํ•˜๋Š” ๊ฒƒ์ด์ง€๋งŒ, ๊ด€๋ก€์ ์œผ๋กœ ์ด๋ฅผ ๋ฅผ ์ค„์ด๊ธฐ ์œ„ํ•œ ๋ฐฉ๋ฒ•์„ ๋งํ•˜๊ธฐ๋„ ํ•œ๋‹ค.
    e.g. Benjamini-Hochberg method(BH method) ๋“ฑ

  • ์ค‘๊ฐ„ ๋ณต์Šต

    ๋น„์Šทํ•œ ๋ฐฉ๋ฒ•์œผ๋กœ ์ด๋ฅผ ๋ฐ˜๋ณตํ•ด์„œ p-value๋ฅผ ๋งŽ์ด ๋ฝ‘์•„๋ณด๋ฉด..

    ์ด๋ฅผ histogram์œผ๋กœ ํ‘œํ˜„ํ•˜๋ฉด,

    Transclude of False_Discovery_Rates_FDR_Notes_Screenshot_(1
    2.png)

    p-value๊ฐ€ .05 ์ˆ˜์ค€์—์„œ ์œ ์˜ํ•œ, .05๋ณด๋‹ค ์ž‘์€ ๊ฒฝ์šฐ๊ฐ€ 510 ๊ฒฝ์šฐ(1๋งŒ p-value ์ค‘)

    20๊ฐœ์˜ bin ์•ˆ์—๋Š” ๊ฐ๊ฐ 5%์”ฉ ๋“ค์–ด๊ฐ€ ์žˆ๋‹ค. (uniform distribution)

    False Discovery Rates FDR Notes July 25.png

    Transclude of False_Discovery_Rates_FDR_Notes_Screenshot_(2
    .png)

    distribution์ด uniformํ•˜๊ธฐ ๋•Œ๋ฌธ์—, p-value๋ฅผ ๋งŒ๋“ค ๋•Œ, ๊ฐ bin์— ๋“ค์–ด๊ฐˆ ํ™•๋ฅ ์ด ๋™์ผํ•˜๋‹ค.(20๊ฐœ์˜ bin์ด๋‹ˆ, ๋Œ€๋žต 5%)

    ์ด๋ฒˆ์—๋Š” ๋ฐ˜๋Œ€๋กœ, ๊ฐ ์ƒ˜ํ”Œ๋“ค์„ ๋‹ค๋ฅธ ๋ถ„ํฌ์—์„œ ๋ฝ‘์•„ p-value๋ฅผ ๊ตฌํ•ด๋ณด์ž.
    (์‹ค์ œ๋กœ ๊ฐ’์ด ๋‚ฎ๊ฒŒ ๋‚˜์˜ค๊ฒ ์ง€. ์ง„์งœ ๋‹ค๋ฅธ ๋ถ„ํฌ์—์„œ ๋ฝ‘์•„ ์™”์œผ๋‹ˆ.)

    False Discovery Rates FDR Notes Sept 6.png

    ๊ทธ๋Ÿฌ๋ฉด p-value ์ž์ฒด์˜ ๋ถ„ํฌ๋„ ์•„๋ž˜์™€ ๊ฐ™์ด ๋  ๊ฒƒ์ด๋‹ค.(p-value๊ฐ€ ์ž‘์•„์งˆ ๊ฒฝ์šฐ๊ฐ€ ๋” ๋งŽ๋‹ค.)

    False Discovery Rates Notes Sept 12.png

    False Discovery Rates Notes Screenshot.png

    p-value๊ฐ€ ์œ ์˜ ์ˆ˜์ค€๋ณด๋‹ค ํฌ๋‹ค : ๋‘ ์ƒ˜ํ”Œ์ด ๊ฐ™์€ ๋ถ„ํฌ์—์„œ ๊ธฐ์ธํ–ˆ๋‹ค. โ†’ ์‹ค์ œ๋กœ๋Š” ๊ทธ๋ ‡์ง€ ์•Š์€๋ฐ(True) ์˜๊ฐ€์„ค์„ ๊ธฐ๊ฐํ•˜์ง€ ๋ชปํ–ˆ์œผ๋‹ˆ, false negative : type II error

    Transclude of False_Discovery_Rates_Notes_Screenshot_(1
    .png)

์ด์ œ ์•ฝํšจ๋ฅผ ํ‰๊ฐ€ํ•˜๊ธฐ ์œ„ํ•œ ์‹คํ—˜์„ ๊ฐ€์ •ํ•ด๋ณด์ž.
1๋งŒ๊ฐœ์˜ ์œ ์ „์ž๋ฅผ ๊ด€์ฐฐํ•  ๊ฑฐ๋‹ค.

black: control / red: treat group

๋งŒ์•ฝ, ์‹ค์ œ๋กœ ์•ฝ์ด 1์ฒœ๊ฐœ์˜ ์œ ์ „์ž์—๋Š” ์ž‘์šฉ์„ ํ–ˆ๊ณ , ๋‚˜๋จธ์ง€ 9์ฒœ๊ฐœ์—๋Š” ์ž‘์šฉํ•˜์ง€ ์•Š์•˜๋‹ค๋ฉด,
๊ฐ๊ฐ์€ ์•„๋ž˜์™€ ๊ฐ™์€ ๋ถ„ํฌ๋ฅผ ๋”ฐ๋ฅผ๊ฑฐ๊ณ ,

์‹ค์ œ ์šฐ๋ฆฌ๊ฐ€ ์–ป์„ ์ˆ˜ ์žˆ๋Š” ๋ถ„ํฌ๋Š” ์•„๋ž˜์™€ ๊ฐ™์ด ๋‘ ๋ถ„ํฌ๊ฐ€ ํ•ฉ์ณ์ง„ ํ˜•ํƒœ์ผ ๊ฑฐ๋‹ค.

Transclude of False_Discovery_Rates_FDR_Notes_Screenshot_(1
3.png)

์ „์ฒด p-value ๋ถ„ํฌ๋Š”

False Discovery Rates Notes Nov 5.png

์ด๋ ‡๊ฒŒ ๊ฐ ๋ถ€๋ถ„์—์„œ ๊ธฐ์ธํ•  ๊ฑฐ๊ณ .

๊ทธ๋ž˜์„œ ์ด๋ ‡๊ฒŒ cutoff ํ•  ์ง€์ ์œผ๋กœ ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ๋‹ค.(eye-ball method)

์‹ค์ œ ์‚ฌ์šฉํ•˜๋Š” ์œ ์˜ ์ˆ˜์ค€์ธ .05์—์„œ๋Š” ์ด ๋ถ€๋ถ„์ด ์ค‘์š”ํ•˜๊ฒ ์ง€.

Transclude of False_Discovery_Rates_Notes_Screenshot_(1
1.png)

Transclude of False_Discovery_Rates_FDR_Notes_Screenshot_(1
4.png)

type-1-and-type-2-errors.webp

์œ„์—์„œ .05 bins์— ํ•ด๋‹นํ•˜๋Š”(์˜๊ฐ€์„ค ๊ธฐ๊ฐ) p-value๋“ค์€ true positive์™€ false positive์ธ๋ฐ, ์ด๊ฑธ ๋ถ„๋ฆฌํ•˜๋Š” ๋ฐฉ๋ฒ•์€ ๋‹จ์ˆœํžˆ ๋” ๋‚ฎ์€ ๊ฐ’๋“ค๋งŒ ์ทจํ•˜๋Š” ๊ฒƒ.