Title: Various Generalizations of Upper-Lower Index and Their Critical Values Authors: Lubomir Stepanek, Patricia Martinkova Affiliation: First Faculty of Medicine, Charles University in Prague; Institute of Computer Science, Czech Academy of Sciences Abstract: While item response theory (IRT) is the most used approach for description of item difficulty and discrimination, classical test theory indices may provide an advantage of a quick and easy-to-understand calculation using basic arithmetic. One popular index for item discrimination, the Upper-Lower Index (ULI) is defined as a difference between proportions of correct answers to an item in the upper and lower group of test takers, as defined by their total score. For ULI, practical questions arise regarding the optimal size of the upper and lower groups and regarding the critical values used to identify relatively poor items for elimination or revision (Engelhart, 1965). In this study, we revisit ULI definitions and their generalizations (Brennan, 1972). We use analytical derivations as well as simulations to get optimal group sizes and to define critical values for cases motivated by university admission tests (Martinkova et al., 2017). References: Engelhart, M. D. (1965). A comparison of several item discrimination indices. Journal of Educational Measurement, 2: 69-76. doi: 10.1111/j.1745-3984.1965.tb00393.x. Brennan, R. L. (1972). A generalized upper-lower item discrimination index. Educational and psychological measurement, 32: 289-303. doi: 10.1177/001316447203200206. Martinkova, P., Stepanek, L., Drabinova, A., Houdek, J., Vejrazka, M., Stuka, C. (2017). Semi-real-time analyses of item characteristics for medical school admission tests. 189-194. doi: 10.15439/2017F380.