• Nenhum resultado encontrado

J. bras. pneumol. vol.41 número5

N/A
N/A
Protected

Academic year: 2018

Share "J. bras. pneumol. vol.41 número5"

Copied!
1
0
0

Texto

(1)

ISSN 1806-3713 © 2015 Sociedade Brasileira de Pneumologia e Tisiologia

http://dx.doi.org/10.1590/S1806-37132015000000215

J Bras Pneumol. 2015;41(5):485-485 Continuing EduCation:

SCiEntifiC MEthodology

485

What does the p value really mean?

Juliana Carvalho Ferreira1,3, Cecilia Maria Patino2,3

Why calculate a p value?

Consider an experiment in which 10 subjects receive a placebo, and another 10 receive an experimental diuretic. After 8 h, the average urine output in the placebo group is 769 mL, versus 814 mL in the diuretic group—a difference of 45 mL (Figure 1). How do we know if that difference means the drug works and is not just a result of chance?

of 45 mL in the average urine output between groups under the null hypothesis. Because this is a very small probability, we reject the null hypothesis. It does not mean that the drug is a diuretic, nor that there is 97% chance of the drug being a diuretic.

Misconceptions about the p value

Clinical versus statistical signiicance of the effect size

There is a misconception that a very small p value means the difference between groups is highly relevant. Looking at the p value alone deviates our attention from the effect size. In our example, the p value is signiicant but a drug that increases urine output by 45 mL has no clinical relevance.

Nonsigniicant p values

Another misconception is that if the p value is greater than 5%, the new treatment has no effect. The p value indicates the probability of observing a difference as large or larger than what was observed, under the null hypothesis. But if the new treatment has an effect of smaller size, a study with a small sample may be underpowered to detect it.

Overinterpreting a nonsigniicant p value that is close to 5%

Yet another misconception is that if the p value is close to 5%, there is a trend towards a group difference. It is inappropriate to interpret a p value of, say, 0.06, as a trend towards a difference. A p value of 0.06 means that there is a probability of 6% of obtaining that result by chance when the treatment has no real effect. Because we set the signiicance level at 5%, the null hypothesis should not be rejected.

Effect sizes versus p values

Many researchers believe that the p value is the most important number to report. However, we should focus on the effect size. Avoid reporting the p value alone and preferably report the mean values for each group, the difference, and the 95% conidence interval—then the p value.

RecoMMended liteRatuRe

1. Glantz SA. Primer in Biostatistics, 5th ed. New York: McGraw-Hill; 2002.

Figure 1. Urine output (mL) for each subject in the placebo (squares) and new drug groups (diamonds).

1000

900

800

700

600

Placebo New drug

Placebo New drug

1. Divisão de Pneumologia, Instituto do Coração – InCor – Hospital das Clínicas, Faculdade de Medicina, Universidade de São Paulo, São Paulo, Brasil. 2. Department of Preventive Medicine, Keck School of Medicine, University of Southern California, Los Angeles, CA, USA.

3. Methods in Epidemiologic, Clinical and Operations Research–MECOR–program, American Thoracic Society/Asociación Latinoamericana del Tórax. The most common way to approach this problem is

to use statistical hypothesis testing. First, we state the null hypothesis of no statistical difference between the groups and the alternative hypothesis of a statistical difference. Then we select a statistical test to compute a test statistic, which is a standardized numerical measure of the between-group difference. Under the null hypothesis, we expect the test statistic value to be small, but there is a small probability that it is large, just by chance. Once we calculate the test statistic, we use it to calculate the p-value.

The p value is deined as the probability of observing the given value of the test statistic, or greater, under the null hypothesis. Traditionally, the cut-off value to reject the null hypothesis is 0.05, which means that when no difference exists, such an extreme value for the test statistic is expected less than 5% of the time.

Imagem

Figure 1. Urine output (mL) for each subject in the placebo  (squares) and new drug groups (diamonds).

Referências

Documentos relacionados

Posto isto, torna-se claro que o principal objetivo desta dissertação consiste em implementar receptores do tipo IB-DFE que apresentam um ótimo desem- penho (como em casos

Por outro lado, na Estratégia de 2015, tal como no Conceito de PE de 2016, se faz referência à influência de fatores políticos em processos económicos, assim como à utilização

Esta dissertação nasce então, de modo a poder dar um contributo nesse sentido, tendo como principal objetivo estudar a utilidade dos jogos sérios no auxílio à integração social

O teor médio de carbono fixo, considerando somente os testes realizados com os pares termoelétricos, foi de 71,77%, independentemente da forma de esfriamento da unidade de produção

We find that the value of the output gap that makes the model shift from one regime to another is -0.73% and only when the output gap is smaller than that value does the fiscal

A mala voadora, o Colectivo 84, Martim Pedroso & Nova Companhia, e o coletivo 3/quartos de Miguel Loureiro são coletivos que se caracterizam por uma constante reinvenção dos

Table 1 summarizes how many times each trigger was activated and how many times at least one criterion was realized in each time interval, both in absolute terms and in relative

A new interpretation of the results of the spatial scan statistic is done, posing a modified inference question: what is the probability that the null hypothesis is rejected for