item at school level • Helps schools implement best practices, leading to increased reliability and validity. • decreases test development costs compared to manual processes and makes it much easier to publish new test forms. • An item bank, a robust repository of test questions and the components that make up those questions, is critical to keeping exam content secure and fresh. • Professional test developers can create tests that accurately measure the knowledge, skills and abilities necessary for competence when there is a bank of well-written items. • Provides test developers and subject matter experts (SMEs) a set of tools to facilitate the writing, review, editing and selection of test questions. • Provides the automation, standardization, and scalability essential to developing and maintaining effective tests. • In most applications of testing and assessment, the items are of multiple choice format, but any format can be used. Items are pulled from the bank and assigned to test forms for publication either as a paper-and-pencil test or some form of e-assessment. • Item banking is the foundation for the development of valid, reliable content and defensible test forms. • Items are reusable objects; items can be used more than once. • Performance could be tracked not only within a test form, but across test forms as well. 24. Explain validity of an assessment • Validity refers to the accuracy of an assessment -- whether or not it measures what it is supposed to measure. • E.g. – the test content/ items must match the course content or unit being taught – If an assessment intends to measure achievement and ability in a particular subject area but then measures concepts that are completely unrelated, the assessment is not valid. • There are three ways in which validity can be measured. In order to have confidence that a test is valid (and therefore the inferences we make based on the test scores are valid), all three kinds of validity evidence should be considered. Type of Validity Definition Example/ Non-Example Content The extent to which the A semester or quarter content of the test exam that only includes matches the instructional content covered during objectives. the last six weeks is not a valid measure of the course's overall objectives -- it has very low content validity. Criterion The extent to which scores If the end-of-year math on the test are in tests in 4th grade correlate agreement with highly with the statewide (concurrent validity) or math tests, they would predict (predictive validity) have high concurrent an external criterion. validity Construct The extent to which an If you can correctly assessment corresponds hypothesize that ESOL to other variables, as students will perform predicted by some differently on a reading rationale or theory. test than English-speaking students (because of theory), the assessment may have construct validity.