how to report outliers in results apa

(Im the poor sap you helped! Thank you so much for being one of those helpful people who posts the answers to their problems. Hey there, sorry for the delay in replying. Notice that it includes error bars representing the standard error and conforms to all the stated guidelines. Note: Depending on the requirements or the projected length of your paper, sometimes the results are combined with the discussion section. We collected all the completely (test statistic, dfs, and p value) reported t and F tests (we did not collect the results from 2 tests as these tests are often less influenced by outliers) from each article with the statcheck package for R [29]. We then lay down an appropriate nomenclature . : , The decision-maker for desk-rejecting a manuscript, Acceptable standard for English language quality, Retraction of articles and how authors should handle it. Competing interests: The authors have declared that no competing interests exist. Similarly, the bootstrap procedure as described in the method section gave a p value of .469. When there are outliers in a sample, the median and interquartile range are used to summarize a typical value and the variability in the sample, respectively. +`C11zDM-E%==@1j FnliSz:c2ZDzd%up/)#J>0Zy3Q O}L*6zQNs O8]H5K;'NA-MeN="UX40P@H$KH7jbAH?eDS?PXt. The number of reported significant results did not differ significantly between the two types of articles (non-registered; Wilcoxon rank sum test: W=3202, p=.140). There are seven main assumptions when it comes to multiple regressions and we will go through each of them in turn, as well as how to write them up in your results section. When there are no outliers in a sample, the mean and standard deviation are used to summarize a typical value and the variability in the sample, respectively. And when it comes to writing it up, again you just say what you see. MB and JMW independently rated the 34 articles and agreed on 125 (81%) of the 154 checked t tests. Line graphs are used to present correlations between quantitative variables when the independent variable has, or is organized into, a relatively small number of distinct levels. You post was very clear and helpful, much better than most of what Ive found online. He could have very easily gotten lawyers involved and most likely the website would have been shut down and we would all be out of this knowledge. Additional planned analyses also showed no significant differences between removal and non-removal of outliers at the journal level (see Table 3). If the VIF value is greater than 10, or the Tolerance is less than 0.1, then you have concerns over multicollinearity. SPSS Statistics Putting it all together Marjan Bakker, Begin the results section by restating each hypothesis, then state whether your results supported it, then give the data and statistics that allowed you to draw this conclusion. Among the low self-esteem participants, those in a negative mood expressed stronger intentions to have unprotected sex (M = 4.05, SD = 2.32) than those in a positive mood (M = 2.15, SD = 2.27). Given that many psychologists admit to excluding data to see how it impacts the results, we consider the exclusion of outliers as an indicator of the potential use of p-hacking [25] or significance chasing [26] in null hypothesis significance testing. An analysis of standard residuals was carried out on the data to identify any outliers, which indicated that participants 8 and 16 needed to be removed. If this is also true in our current sample of articles, the group of articles that did not report any exclusions of outliers might be contaminated with studies in which these values actually were removed from the analyses. So after two weeks of wading through websites, texts book and having multiple meetings with my university supervisors, I thought I would take the time to write up some instructions on how to report multiple regressions in APA format so that the next poor sap who has this issue doesnt have to waste all the time I did. This is because at very high rates of taxation, people either lose interest in working, or they start to seek ways of hiding their income from the government. I noticed that you have reproduced some images from my textbook Discovering Statistics Using SPSS without acknowledging from where they came. Nevertheless, none of our preregistered hypotheses were confirmed. document.getElementById( "ak_js_1" ).setAttribute( "value", ( new Date() ).getTime() ); Your email address will not be published. very helpful when nowhere else brings it all together like this. The differences between the raters were solved by discussion. Step 2: Identify the median, the first quartile (Q1), and the third quartile (Q3) To that end, we collected the 353 articles that contained the word outlier for all articles published between 2001 and 2010 in the following journals: Journal of Experimental Social Psychology (JESP), Cognitive Development (CD), Cognitive Psychology (CP), Journal of Applied Developmental Psychology (JADP), Journal of Experimental Cognitive Psychology (JECP), and Journal of Personality and Social Psychology (JPSP). Check the Variance box under the heading Dispersion and then click Continue. Your IP: However, I will show you how to calculate the regression and all of the important assumptions that go along with it. I did not understand this sort of analysis at all. (supposedly a biased sample leading to underestimates), 11.2% admitted that they had not fully disclosed all excluded values in their article. We included only articles that did not report removing (exclusion; see also [28]) or adapting (transforming) any values, leading to a control sample of 88 articles. << /Length 5 0 R /Filter /FlateDecode >> The exclusion of data is also one of the few QRPs that can be detected by carefully reading a published article, as the removal of outliers and other data should be mentioned in the text in accordance with common guidelines. Tilburg School of Social and Behavioral Sciences, Tilburg University, Tilburg, The Netherlands. Results show no significant difference between the two types of articles in median p value, sample sizes, or prevalence of all reporting errors, large reporting errors, and reporting errors that concerned the statistical significance. For our data at hand, quartile 1 = 811.5 and the IQR = 352.5. Although they sometimes extend one standard deviation in each direction, they are more likely to extend one standard error in each direction (as in Figure 12.12 Sample APA-Style Bar Graph, With Error Bars Representing the Standard Errors, Based on Research by Ollendick and Colleagues). it really helps, a lot. Therefore, other methods that are less influenced by possible outliers, like non-parametric or robust statistics [36], should be considered Furthermore, to prevent the (unconscious) subjective removal of outliers, an outlier handling protocol could be written down before seeing the data. Because the number of errors in an article is dependent of the number of reported statistics, we used a negative binomial regression in which we controlled for the number of reported statistics (log) to predict the number of errors (for all, large, and gross errors separately). The standard error is used because, in general, a difference between group means that is greater than two standard errors is statistically significant. These include using words only for numbers less than 10 that do not represent precise statistical results, and rounding results to two decimal places, using words (e.g., mean) in the text and symbols (e.g., . Thus our control group of articles in which the exclusion of outliers was not mentioned could also have contained articles in which outliers were indeed removed. The test-retest correlation was .96. When it comes to writing this up what you put depends on what results you got. hb```leal xB1B>s9; This will allow you to check for random normally distributed errors, homoscedasticity and linearity of data. Sexual strategies theory: A contextual evolutionary analysis of human mating. but here as I am working on 2 DVs. Of those who responded to LeBel et al. In 41% of the articles we checked, we found at least one discrepancy between sample size description and the dfs. Graphs and tables should add information rather than repeating information, be as simple as possible, and be interpretable on their own with a descriptive caption (for graphs) or a descriptive title (for tables). The same general principles apply to tables as apply to graphs. Captions in an APA manuscript should be typed on a separate page that appears at the end of the manuscript. 318 0 obj <>stream Each analysis you run should be related to your hypotheses. Each reported p value (p<.05) was recalculated based on the reported test statistic and df with the statcheck package. Future research should address the prevalence of misreporting of dfs and/or the reasons why so often the described dfs are inconsistent with the reported sample size. I also searched for hours for a comprehensive to-do list for the multiple regressions I am doing for my dissertation. Of all statistical results, 1847 (69%) were found by using the statcheck package and 820 (31%) while manually checking the results. They were interested in the relationships between working memory and several other variables. LeBel et al. broad scope, and wide readership a perfect fit for your research every time. Only for the smallest recalculated p values (<.000001) we witnessed a difference between the two distributions (Fisher-exact-test: p=.013; non registered comparison). Above all, researchers should be transparent in their articles about the exclusion of data. Go down the list and if you find any values equal or over 3.29, or less than or equal to -3.29 then that participant is an outlier and needs to be removed. The graph should be slightly wider than it is tall. I copied them from a powerpoint presentation produced by my university and as such did not know I was violating any copyright. These graphs encode five characteristics of distribution of data by showing the reader their position and length. Again, we focus here on tables for an APA-style manuscript. Performance & security by Cloudflare. Another explanation might be that articles in which nothing is reported about removing outliers, actually did involve the removal of outliers (or other data points). Given that unreported exclusion of data may not always be visible by comparing the dfs and reported sample sizes, it is quite possible that unreported exclusion of data is even more common in psychological research than our current results suggest. Buss, D. M., & Schmitt, D. P. (1993). I have found it super useful I missed information about Cooks Distance. Visit your library. To check whether this lack of reported exclusions could have influenced our results, we checked the consistency between the sample size and the reported df of t tests in articles that did not report any data removal or missingness. Now I am not going to show you how to enter the data into SPSS, if you dont know how to do that I recommend you find out first and then come back. However, there might be other explanations. Cloudflare Ray ID: 7c0cc60fec182a14 What I will do, if I can find it, is post a picture of the tables from the dissertation that I wrote this blog post about. In my experience if you do things the way the person reviewing your work does them you will probably be ok , A great post! Thank you very much! Number of articles, number articles with at least one error, number of articles with at least one large error, and number of articles with at least one gross error for each journal separately for articles in which outliers are removed and for articles that did not report any removal of outliers. Again this is more art than science and comes down to how you interpret the image. Notably, according to the APA publication manual, omitting troublesome observations from reports to present a more convincing story is [] prohibited (p. 12 [12]). Interpret and create simple APA-style graphsincluding bar graphs, line graphs, and scatterplots. When you have a small number of results to report, it is often most efficient to write them out. Click Continue and then click the Statistics button. For a more fine-grained analysis, future research could use the p curve method [25] which focuses only on the results of the main analysis. If you do then this means that your data has met the assumption of normally distributed residuals. This article has really helped me and i have to say that you are a god send for creating this! Number of statistics, number of errors, number of large errors, and number of gross errors for each journal separately for articles in which outliers were removed and for articles that did not report any removal of outliers. H|Un0+(aRr)PXl68nLJAJ.wn6TlhC1eqELG\n|GYqh>yw9,K{[m"H o5 Wicherts et al. }/"M~Ww;E{Cb b>v-&H4 - We can see from the table that the correlation between working memory and executive function, for example, was an extremely strong .96, that the correlation between working memory and vocabulary was a medium .27, and that all the measures except vocabulary tend to decline with age. You will have the opportunity to give your own interpretations of the results in the discussion section. Since power is positively related to sample size, we predicted the average sample size to be lower for articles in which outliers were removed compared to articles that reported no outlier removal. I would also highly recommend Andy Fields books but when youre in a panic state just before hand-in books are a paperweight made entirely of stress. You should expect to get a non-zero variance value, unless all your data is identical, and it just tells you that there is a difference between your IVs, https://www.youtube.com/watch?v=wc9NE7sMqjg, Your email address will not be published. Thank you so much. One must check to see if these outliers are valid data from the data source. We hypothesized that outlier exclusion would be associated with relatively high p values (below the .05 threshold), more reporting errors, and smaller sample sizes and studied this in a sample of psychology papers. qxpV/b*m Rossi [33] found 4 cases of inconsistent dfs in his sample of 46 t and F tests. The screenshot shows how to put these numbers together for trial 1. Ok, so that is all the assumptions taken care of, now we can get to actually analysing our data to see if we have found anything significant. Figure 12.14 Sample APA-Style Scatterplot. Yet, relatively little is known about how affordances provided by social media platforms affect whether and how users express political opinions. Earlier, we [14] documented that more than half of the articles in psychology that involved the use of null hypothesis significance testing contained at least one such reporting error (see also [15], [16]). Another reason for expecting this relationship is that outliers exert relatively more influence on statistical results in small samples. APA style includes several rules for presenting results in graphs and tables. None of the journals showed a significant difference between the two types of articles in terms of the number of reported significant results per article (see Table 2). However, after a certain tax rate is reached, we start to see a new effect take place wherein the tax revenue drops off as the tax rate is increased further. What you are looking for is for the dots to be on, or close, to the line running diagonally across the screen. The sharing of data for verification purposes is not common practice in psychology [1], [2] and other research fields [3][11]. Researchers often lack knowledge about how to deal with outliers when analyzing their data. Correct any data entry or measurement errors. In a results section, your goal is to report the results of the data analyses used to test your hypotheses. Respected Sir, thank you so so much for considering my query. If it looks like any of the others then one or both of the assumptions has not been met (The lines have been added to show the shape of the date, these will not appear on the actual scatterplot). Something like this: The histogram of standardised residuals indicated that the data contained approximately normally distributed errors, as did the normal P-P plot of standardised residuals, which showed points that were not completely on the line, but close. In their study, they compared a subset of the articles used in Wicherts et al. Scroll through your results until you find the box headed Residual Statistics. (The how to report is the only thing that I can not always figure out 100% from your book). We failed to find a significant difference between the articles in which outliers were removed and articles that reported no outlier removal in the median p value, number of errors, or the median sample size. You have a couple of extreme values in your dataset, so you'll use the IQR method to check whether they are outliers. ), Thanks for that, I am really glad that something I put together really just for my own information has been of so much use to people over the years . Only articles with at least one completely reported t or F test, with a reported p value smaller than .05 were included in our final sample. We had three preregistered hypotheses: (1) Insofar that researchers remove outliers to get a significant p value (p<.05), we expected the average significant p value to be higher (closer to .05) in articles in which outliers were removed than in articles that reported no removal of outliers [13], [25]. [13] used the same method as we used in the current study to compare articles of which the data were or were not shared and they found clear differences between the two types of papers. Copyright: 2014 Bakker, Wicherts. OMG ANDY your book is my long time go to bible whenever I have to write a thesis or paper or difficult statistical things. To collect a comparable sample of articles in which outliers were not removed, we randomly selected 25 articles from each journal (12 from CD) in the same timeframe (2001 till 2010). To do this, you need to identify your data analysis technique, report your test statistic, and provide some interpretation of the results. For now, click OK to run the tests. , Hi I am just writing up results for a third year stats paper: how would you report a Mahalanobis distance test that detected three outliers on two predictor variables? Number or articles per journal: (1) that mentioned outlier,(2) that were checked by Bakker & Wicherts. Step 1: Sort your data from low to high First, you'll simply sort your data in ascending order. HUr0{f@d2c;iNzbc:5xl2i8$EH v72p~ :aUH6"b ZJ~M-x]M*bx9M}KKS$K _ (If a graph presents information more clearly or efficiently, then you should keep the graph and eliminate the text or table.) There are several actions that could trigger this block including submitting a certain word or phrase, a SQL command or malformed data. x[nF}Wrmbv3 _Tb"5QrC{IFJ'u=uE,%$qOH5w6>{JgoMhm?D`/xX~|N>8&9gH? Rl#W=Vq|_@z*xmBeLSAz|}o2 -X"5. Present the results of tests in the order that you performed themreport the outcomes of main tests before post-hoc tests, for example. Decimal places and leading zeros The number of decimal places to report depends on what you're reporting. If we have any they will need to be dealt with before we can analyse the rest of the results. This was repeated 100,000 times and the W values were collected to get an empirical null distribution, with which the W value of the actual difference could be compared. Then again, Wicherts et al. These assumptions deal with outliers, collinearity of data, independent errors, random normal distribution of errors, homoscedasticity & linearity of data, and non-zero variances. We choose to focus here on t tests, as the relation between the df of the t test and the sample size is quite clear. 4/( `Tfc0@EaV-g&'l);H330a`y s\F Check your attitude before you go on the offensive Andy. You must be able to attribute a specific cause for removing outliers. The most common use of tables is to present several means and standard deviationsusually for complex research designs with multiple independent and dependent variables. Flowchart of papers in the set of papers that stated outlier removal (left) and the set of papers that did not report any removal of outliers (right). e103360. Im writing my thesis right now. Otherwise, your data has met the assumption of collinearity and can be written up something like this: Tests to see if the data met the assumption of collinearity indicated that multicollinearity was not a concern (IQ Scores, Tolerance = .96, VIF = 1.04; Extroversion, Tolerance = .96, VIF = 1.04). tv}.mYc7/l@`\FV=lF%@Y(sXf~R]Z0Wyvz3nn[S_)^% !XR;zt?Wr;Noz>qMWC@t@!\#NEBS]Vs{y6TaCopjt8I;N}y)p*g*q83k/_5mYrKrX&E'[sc5U^U^ ` s(H After the collection of the results by statcheck, we searched all articles by hand to identify and include missed reported results that were not reported in APA style (e.g., because they reported an effect size between the test statistic and the p value). Note that we did not find significant differences in median p value, number of errors, or sample size, between the articles in which we found or did not found a discrepancy, but the sample size and therefore the power for these comparisons were very low. }nt%G5VM'a8$JD'$1 In our preregistration we mention 106 articles of which outliers were removed before the actual analyses. I searched online, offline, but did not find any solution. Notice here that only half the table is filled in because the other half would have identical values. In boxplots, potential outliers are defined as follows: low potential outlier: score is more than 1.5 IQR but at most 3 IQR below quartile 1; high potential outlier: score is more than 1.5 IQR but at most 3 IQR above quartile 3. Self-esteem, mood, and intentions to use condoms: When does low self-esteem lead to risky health behaviors? There is no reference supporting the rules of thumb for VIF and tolerance so I was wondering, what work do you cite to support these rules? 293 0 obj <> endobj The median number of reported statistics was 14 (M=19.4) for articles in which outliers were removed and 12 (M=14.5) for articles that reported no outlier removal. Other ways are by displaying the number of individuals in parentheses next to the point or by making the point larger or darker in proportion to the number of individuals. I have been doing my thesis using multiple regression as techniques of data analysis really I found this post very helpful. Notice that it conforms to all the guidelines listed. 's [22] study formed a Guttman scale, gross errors can be seen as a good indicator of the use of other QRPs (including exclusion of outliers). Notice that when presented in the narrative, the terms mean and standard deviation are written out, but when presented parenthetically, the symbols M and SD are used instead. Reporting errors are discrepancies between the reported p value and the recalculated p value based on the reported test statistic and degrees of freedom (df). CD contained only 12 articles that used the term outlier in the given timeframe, and all 12 articles were examined. If you have no interest in statistics then I recommend you skip the rest of this post. Its easy to make mistakes when you start out. %Ebgqb~eF0# (`_/@BhcRn#3QET&dAYL ?eK$751SE!xyyWIn7[9s!. Such contamination might influence our results. Organizing Results Random Normally Distributed Errors & Homoscedasticity & Linearity. Analyzed the data: MB JMW. To see if the data meets the assumption of collinearity you need to locate the Coefficients table in your results. In Figure 12.13 Sample APA-Style Line Graph Based on Research by Carlson and Conard, for example, one could replace each point with a bar that reaches up to the same level and leave the error bars right where they are. The main purpose of a lab report is to demonstrate your understanding of the scientific method by performing and evaluating a hands-on lab experiment. These QRPs ranged from failing to report all of a study's dependent variables (admitted by 63%) to falsifying data (admitted by .6%). MacDonald, T. K., & Martineau, A. M. (2002). Table 1 gives the number of articles per journal. @~H{]9N^TI$-~ $rJ[4vIMh_uO When I started a Research Assistantship this semester, I found out how many things my beginning stats and research methods in psychology courses did not cover! Finally, the straight line that best fits the points in the scatterplot, which is called the regression line, can also be included. The flow chart is given in Figure 1. Another approach is to perform the analysis with and without these observations and discuss the differences. We found a proportion of reporting errors comparable with our earlier results [13], [14]. And I almost didnt click on it because the others were so disappointingthanks once again, youll be glad to know this is still making a difference almost 10 years later! Im going to deal with these three things together as all the information comes from the same place. Reports can be created with and without detected outliers so statisticians and researchers can best decide on appropriate statistical methods and properly interpret the analysis results. Im trying to write up my data analysis for my thesis and I am sorely lacking in statistics knowledge. Recent results suggest that not all exclusions of data are reported in psychological articles. In SPSS you need to click Analyse > Regression > Linear and you will get this box, or one very much like it depending on your version of SPSS, come up. %%EOF Residual (Standardised Residual) subheading. %M3b8'\6+SM`/43Ekcm>\k} Funding: The preparation of this article was supported by grant numbers 400-08-214 and 016-125-385 from the Netherlands Organization for Scientific Research (NWO). First, the graph should always add important information rather than repeat information that already appears in the text or in a table. Then, under the Standardized Residual Plots heading, tick both the Histogram box and the Normal probability plot box. However, our control sample might be contaminated due to nondisclosure of excluded values in articles that did not report exclusion of outliers. One limitation of my study is that the sample is non independent (the sample consists of couples and they need to fill in the surveys for multiple times). Thank you for bringing this to my attention, I will be more careful in future to find out the source of any images I use and give appropriate credit.