Assessment and Evaluation in Education: July 2020
Assessment and Evaluation in Education: July 2020
Assessment and Evaluation in Education: July 2020
net/publication/342918149
CITATIONS READS
0 10,666
2 authors, including:
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
All content following this page was uploaded by Tomas de Aquino Caluyua Yambi on 14 July 2020.
2018
1. Introduction
Unleashing the potential of continuous improvement in teaching/learning
requires an appreciation of the difference in spirit between assessment and evaluation.
Assessment is frequently confused and confounded with evaluation. The purpose of
an evaluation is to judge the quality of a performance or work product against a
standard. The fundamental nature of assessment is that a mentor values helping a
mentee and is willing to expend the effort to provide quality feedback that will
enhance the mentee's future performance. While both processes involve collecting
data about a performance or work product, what is done with these data in each
process is substantially different and invokes a very different mindset. This paper first
looks at what assessment is and the various aspects involving. Then attention will be
turn to evaluation and its components. Furthermore, it will look at testing as a tool
used by both assessment and evaluation, lastly some differences between assessment
and evaluation will be presented.
3. What is Assessment
Numerous terms are used to describe different types to learner assessment. Although
somewhat arbitrary, it is useful to these various terms as representing dichotomous
poles (McAlpine, 2002).
2
Formative assessment is designed to assist the learning process by providing
feedback to the learner, which can be used to identify strengths and weakness and
hence improve future performance. Formative assessment is most appropriate where
the results are to be used internally by those involved in the learning process
(students, teachers, curriculum developers). Summative assessment is used primarily
to make decisions for grading or determine readiness for progression. Typically
summative assessment occurs at the end of an educational activity and is designed to
judge the learner’s overall performance. In addition to providing the basis for grade
assignment, summative assessment is used to communicate students’ abilities to
external stakeholders, e.g., administrators and employers (Darling-Hammond, 2006).
With informal assessment, the judgments are integrated with other tasks, e.g.,
lecturer feedback on the answer to a question or preceptor feedback provided while
performing a bedside procedure. Informal assessment is most often used to provide
formative feedback. As such, it tends to be less threatening and thus less stressful to
the student. However, informal feedback is prone to high subjectivity or bias. Formal
assessment occurs when students are aware that the task that they are doing is for
assessment purposes, e.g., a written examination. Most formal assessments also are
summative in nature and thus tend to have greater motivation impact and are
associated with increased stress. Given their role in decision-making, formal
assessments should be held to higher standards of reliability and validity than
informal assessments (McAlpine 2002).
3
information, process assessment is most useful when a student is learning a new skill
and for providing formative feedback to assist in improving performance (McAlpine
2002). Product assessment focuses on evaluating the result or outcome of a process.
Using the above examples, we would focus on the answer to the math computation or
the accuracy of the blood test results. Product assessment is most appropriate for
documenting proficiency or competency in a given skill, i.e., for summative purposes.
In general, product assessments are easier to create than product assessments,
requiring only a specification of the attributes of the final product (McAlpine 2002).
After collecting data from students there is then the need for assigning
students with numbers or others symbols to a certain characteristic of the objects of
interest according to some specified rules in order to reflect quantities of properties.
This is called measurement and can be attributed to students’ achievement,
4
personality traits or attitudes. Measurement then is the process of determining a
quantitative or qualitative attribute of an individual or group of individuals that is of
academic relevance. A test will serve as the vehicle used to observe an attribute
whether in a written test or an observation or an oral question or an assessment
intended to measure the respondents' knowledge or other abilities. Then if the test is
the vehicle then the test score is the indication of what was observed through the test
and can also be quantitative and qualitative in nature.
A good test should possess not only validity and reliability but also
objectivity, objective basedness, comprehensiveness, discriminating power,
practicability, comparability and also utility (Shohamy 1993). Objectivity is when a
test is to be said objective if it is free from personal biases in interpreting its scope as
well as in scoring the responses. It can be increased by using more objective type test
items and the answers are scored according to model answers are provided. Objective
basedness is that a test should be based on pre-determined objectives. And a test setter
should have definite idea about the objective behind each item (Shohamy 1993).
Comprehensiveness is that the test should cover the whole syllabus, due importance
should be given all the relevant learning materials, and a test should cover all the
anticipated objectives. Validity is the degree to which test measures what it is to
measure. Reliability is of a test refers to the degree of consistency which it measures
what is intended to measure. A test may be reliable but need not be valid. This is
because it may yield consistent scores but these scores need not be representing what
is exactly measured what we want to measure (Shohamy 2001). Discriminating power
of the test is its power to discriminate between the upper and lower groups who took
the test. The test should have different difficulty level of questions. Practicality of the
test depends on administrative, scoring, interpretative ease and economy.
Comparability is when a test possesses comparability when scores resulting from its
use can be interpreted in terms of a common base that has a natural or accepted
meaning. Then lastly the utility, a test has utility if it provides the test condition that
would facilitate realization of the purpose for which it is mean.
Educators believe that every measurement device should possess certain
qualities. Perhaps the two most common technical concepts in measurement are
reliability and validity (Weir 2005). Any kind of assessment, whether traditional or
"authentic," must be developed in a way that gives the assessor accurate information
about the performance of the individual (Weir 2005). At one extreme, we wouldn't
have an individual paint a picture if we wanted to assess writing skills. A test high
validity has to be reliable also for the score will be consistent in both cases. A valid
test is also a reliable test, but a reliable test may not be a valid one (Shohamy 2001).
5. What is Evaluation
Here are some principles to consider for your own classroom summarised
from (Weir & Roberts, 1994; Howard & Donaghue 2015; (Kellaghan & Stufflebean
2003):
7
analysis, and reporting. It can therefore be seen as the systematic gathering of
information for purposes of decision-making, using both quantitative methods (tests)
and qualitative methods (observations, ratings and value judgments) with purpose of
judging the gathered information. In other words, when the teachers receive written
assignment from students, some kind of correction and/or response and a possible
mark will be given. Thus we are in presence of evaluation. However, assessment and
evaluation are similar in that they both involve specifying criteria and collecting
data/information. In most academic environments, they are different in purpose,
setting criteria, control of the process, and response. For example, an instructor can
use the results of a midterm exam for both assessment and evaluation purposes. The
results can be used to review with the students course material related to common
mistakes on the exam (i.e. to improve student learning as in assessment) or to decide
what measurement or grade to give each student (i.e. to judge student achievement in
the course as in evaluation) (Howard & Donaghue 2015).
1. The process of collecting, reviewing and using data, for the purpose of
improvement in the current performance, is called assessment. A process of
passing judgment, on the basis of defined criteria and evidence is called
evaluation.
2. Assessment is diagnostic in nature as it tends to identify areas of
improvement. On the other hand, evaluation is judgemental, because it aims at
providing an overall grade.
3. The assessment provides feedback on performance and ways to enhance
performance in future. As against this, evaluation ascertains whether the
standards are met or not.
4. The purpose of assessment is formative, i.e. to increase quality whereas
evaluation is all about judging quality, therefore the purpose is summative.
5. Assessment is concerned with process, while evaluation focuses on product.
6. In an assessment, the feedback is based on observation and positive &
negative points. In contrast to evaluation, in which the feedback relies on the
level of quality as per set standard.
7. In an assessment, the relationship between assessor and assessee is reflective,
i.e. the criteria are defined internally. On the contrary, the evaluator and
evaluatee share a prescriptive relationship, wherein the standards are imposed
externally.
8. The criteria for assessment are set by both the parties jointly. As opposed to
evaluation, wherein the criteria are set by the evaluator.
8. Conclusion
8
An effective, goal-oriented, teaching-learning sequence contains clearly
understood objectives, productive classroom activities, and a sufficient amount of
feedback to make students aware of the strengths and weaknesses of their
performances. Assessment and evaluation are related to both instructional objectives
and classroom learning activities and are indispensable elements in the learning
process. They are useful for gathering data/information needed into various interests.
The data can be used to make decision about the content and methods of instruction,
to make decisions about classrooms climate, to help communicate what is important,
and to assign grades. Among other techniques to do evaluation and assessment, The
teachers can use tests to evaluating and assessing, starting from the small one,
incorporating evaluation into the class routine, setting up an easy and efficient record-
keeping system, establishing an evaluation plan, and personalizing the evaluation
plan.
9. References