Matches in SemOpenAlex for { <https://semopenalex.org/work/W2028842926> ?p ?o ?g. }
Showing items 1 to 63 of
63
with 100 items per page.
- W2028842926 endingPage "199" @default.
- W2028842926 startingPage "197" @default.
- W2028842926 abstract "When writing or reading articles, one should be aware whether the statistical tests performed were appropriate for the type of data collected and used, thereby avoiding misleading conclusions. The goal of all statistical tests is to determine whether two (or more) variables are associated with one another or independent from each other at the population level.In this issue of IJEM, our Clinical Research Capsule reviews the most common tests used in published literature and some of the pitfalls associated with their use. This article is intended for non-statisticians.One of the first things to keep in mind is the type of data and outcomes the author wants to measure and correlate. In order to do this, one must define the variables of the study. Are we looking at a continuous variable, one that can be quantified on an infinite scale, such as temperature or age, or is it a categorical variable, one that has to be grouped in classes. Categorical variables can be nominal or ordinal. Nominal variables are data that can be counted, but not ordered or measured. Nominal data can be further broken down into dichotomous (e.g., dead or alive) or have several categories (e.g., blood type). Ordinal data are numerical values that have a natural order and thus can be ranked and ordered. However, the distance between two values on an ordinal scale may not represent an equal degree of difference. For example, the modified Rankin score is a measure of outcome after stroke where a value of 0 is no functional deficit, and a value of 6 is dead. The difference between 0 and 1 is slight; however, the difference between a 2 and a 3 is very significant, as it distinguishes being functionally independent (2) versus dependent (3). Other examples of ordinal variables include birth order and pain severity scales.The second important thing to keep in mind is how the results are distributed. Do they follow a “bell curve” (also called a Gaussian distribution), similar to biological phenomena and exam grading techniques, or do the results tend to cluster resulting in a skewed distribution? (Fig. 1). With normally distributed data, mean and SD are reported. For skewed data, median and interquartile ranges are reported. Fig. 1Left panel demonstrates a normal distribution of mean arterial pressure (MAP) in patients with acute ischemic stroke; right panel demonstrate skewed distribution of door to computed tomography time in patients with acute ischemic strokeSometimes, it may be desirable to “normalize” skewed data. This is known as transformation. When data are skewed, the commonly applied transformations are 1/x, log(x), and sqrt(x), exponentiating, squaring, or cubing x, where the x’s are the data values.Also to be considered is whether the data are matched—meaning are sample subjects or data points related to one another, or are they independent?Once these three questions are answered, one is able to choose to the appropriate statistical test and, thus, decipher if an inappropriate test is used.For two dichotomous or binary variables, one will be able to build a 2 × 2 table. If the data follow a normal distribution, the most common test will be Chi-square test. It is used to compare the proportion of subjects in two groups, and verify the independence of each other. For example, if a study about a certain treatment obtains data that shows that it reduces mortality more than placebo for a given disease, one would like to know if the results are true or merely a coincidence. Therefore, we perform a Chi-square test and obtain the p value. One limitation of the Chi-square testing is that its distribution breaks down as the frequencies decrease. If in one of the cells of your table there are five or less observations, the data is considered skewed. In this case, you need to use Fisher’s exact test, specifically designed for small samples.For two continuous variables (e.g., respiratory rate versus age), one can use linear regression or correlation. Linear regression allows us to predict the outcome for a particular value of the predictor. Correlation help us measure the association and direction between the variables.In cases where the outcome or dependent variable (Y axis) is continuous (e.g., high blood pressure) and the independent variable (X axis) is binary (e.g., smoking yes/no); the distribution of the dependent variable will guide one in using (1) parametric tests [t test and analysis of variance (ANOVA)] for normally distributed data, or (2) nonparametric tests (Wilcoxon/Kruskal–Wallis or rank sum tests) for skewed data (Table 1). Table 1Statistical test suggestedIn cases where the outcome or dependent variable (Y axis) is binary and the independent variable (X axis) is continuous, one should use logistic regression analyses.When the data is matched (e.g., before and after measurement of a variable in the same patient), the appropriate test would be the McNemar test.There are many sources of errors when selecting a statistical test. The first involves sources of bias. These are conditions or circumstances which affect the external validity of statistical results. The second are errors in methodology, which can lead to inaccurate or invalid results. The third are interpretation of results or how statistical results are applied to real world issues.Common pitfalls: Reporting the skewed data with mean and SD. Normally distributed data should be reported with mean (average) and SD or confidence intervals, and skewed data should be reported with median and interquartile ranges.The study’s overall statistical analyses were not performed to reject the null hypothesis.If the investigator constructs a loose protocol and allows the experimenters to vary how they conduct the experimental procedures or interviews with different subjects, it is likely that the results of the experiment will be misleading.The decision not to use the data was made after inspection of the results and without a predetermined rationale.After an overall analysis had failed to reject the null hypothesis, the investigators perform a large number of new statistical tests on the data.Do not take account of changing levels of significance when many statistical tests were performed on a single set of data (for example, perform 20 comparisons with one set of data)Data omission—i.e., all the data in the analyses are not included (deleting patients with “inconvenient” results)Conclusions cannot be derived form the results of the studyReporting only p values. The mean, median, SD, confidence interval, relative risk, odds ratio, etc. should all be reported, to allow the reader to critique for him/herself the validity of the results. Look at magnitudes rather than p values.Causal inference. Observational studies are very limited in their ability to show causal relationships. We will require a multifaceted approach to the research use of chronologically structured designs (placing variables in the roles of antecedents and outcomes) and ability of replication, to come to any conclusions regarding causality.Precision and accuracy. Precision refers to how finely an estimate is specified; whereas accuracy refers to how close an estimate is to the true value. Estimates can be precise without being accurate.There seems to be an erroneous notion that you can prove anything with statistics. However, this is only true if you use them improperly. Many times the data is overlooked, or the statistical test is not correctly selected. Always keep in mind that the simpler the experiment, the better will be its execution, and the more likely will one be able to see the “real truth” and what the results actually mean." @default.
- W2028842926 created "2016-06-24" @default.
- W2028842926 creator A5012240258 @default.
- W2028842926 creator A5047756036 @default.
- W2028842926 creator A5069260976 @default.
- W2028842926 date "2008-09-01" @default.
- W2028842926 modified "2023-09-26" @default.
- W2028842926 title "Understanding statistical tests in the medical literature: which test should I use?" @default.
- W2028842926 doi "https://doi.org/10.1007/s12245-008-0061-z" @default.
- W2028842926 hasPubMedCentralId "https://www.ncbi.nlm.nih.gov/pmc/articles/2657277" @default.
- W2028842926 hasPubMedId "https://pubmed.ncbi.nlm.nih.gov/19384516" @default.
- W2028842926 hasPublicationYear "2008" @default.
- W2028842926 type Work @default.
- W2028842926 sameAs 2028842926 @default.
- W2028842926 citedByCount "11" @default.
- W2028842926 countsByYear W20288429262012 @default.
- W2028842926 countsByYear W20288429262019 @default.
- W2028842926 countsByYear W20288429262020 @default.
- W2028842926 countsByYear W20288429262021 @default.
- W2028842926 countsByYear W20288429262022 @default.
- W2028842926 crossrefType "journal-article" @default.
- W2028842926 hasAuthorship W2028842926A5012240258 @default.
- W2028842926 hasAuthorship W2028842926A5047756036 @default.
- W2028842926 hasAuthorship W2028842926A5069260976 @default.
- W2028842926 hasBestOaLocation W20288429261 @default.
- W2028842926 hasConcept C126322002 @default.
- W2028842926 hasConcept C151730666 @default.
- W2028842926 hasConcept C19527891 @default.
- W2028842926 hasConcept C2777267654 @default.
- W2028842926 hasConcept C555175668 @default.
- W2028842926 hasConcept C71924100 @default.
- W2028842926 hasConcept C86803240 @default.
- W2028842926 hasConceptScore W2028842926C126322002 @default.
- W2028842926 hasConceptScore W2028842926C151730666 @default.
- W2028842926 hasConceptScore W2028842926C19527891 @default.
- W2028842926 hasConceptScore W2028842926C2777267654 @default.
- W2028842926 hasConceptScore W2028842926C555175668 @default.
- W2028842926 hasConceptScore W2028842926C71924100 @default.
- W2028842926 hasConceptScore W2028842926C86803240 @default.
- W2028842926 hasIssue "3" @default.
- W2028842926 hasLocation W20288429261 @default.
- W2028842926 hasLocation W20288429262 @default.
- W2028842926 hasLocation W20288429263 @default.
- W2028842926 hasLocation W20288429264 @default.
- W2028842926 hasOpenAccess W2028842926 @default.
- W2028842926 hasPrimaryLocation W20288429261 @default.
- W2028842926 hasRelatedWork W1552878678 @default.
- W2028842926 hasRelatedWork W1943049635 @default.
- W2028842926 hasRelatedWork W1986554448 @default.
- W2028842926 hasRelatedWork W2030084852 @default.
- W2028842926 hasRelatedWork W2116145403 @default.
- W2028842926 hasRelatedWork W2333903716 @default.
- W2028842926 hasRelatedWork W2379006159 @default.
- W2028842926 hasRelatedWork W3137383644 @default.
- W2028842926 hasRelatedWork W4231862243 @default.
- W2028842926 hasRelatedWork W4289785777 @default.
- W2028842926 hasVolume "1" @default.
- W2028842926 isParatext "false" @default.
- W2028842926 isRetracted "false" @default.
- W2028842926 magId "2028842926" @default.
- W2028842926 workType "article" @default.