"Teaching without learning is just talk." (Unknown)
Any discussion about educational reform must establish a working concept of knowledge and how education serves the acquisition of knowledge.
Master said, "Yu I shall I teach you what knowledge is? When you know a thing, to recognize that you know it, and when you do not know a thing, to recognize that you do not know it. That is knowledge" (Confucius 551-479 BC)
Compare this to a modern concept of knowledge acquisition:
- Unconscious incompetence: I am ignorant and therefore without any ability.
- Conscious incompetence - I am acquainted with the knowledge but my abilities require great conscious effort with many mistakes.
- Conscious competence - I am familiar with the knowledge and my abilities are natural but still deliberate.
- Unconscious competence - I have mastered the knowledge, my abilities are fully developed and require little or no conscious management (Unknown).
In the first example, Confucius contemplates knowledge as a duality of knowing verses ignorance mediated by awareness of one's internal state in relation to this duality. The use of knowledge is implied but not central. In the second example, knowledge contains both information and action. The usefulness of knowledge is assumed and central to the concept. In contrast to Confucius, the modern concept contains a transitory conscious awareness of knowing and focuses on the processual nature of learning; true knowledge is mastery beyond deliberate thought. "Certainty" is an unspoken element of both concepts of knowledge and a key element of facilitating the acquisition of knowledge within the Connected Learning framework. Technology applied to the acquisition of knowledge is referred to as Knowledge Technology (KT).
1.1 Certainty
Knowledge provides order to our lives. On a personal and social plain, the things that we hold to be correct with certainty provide a framework on which we build predictability and develop action. Knowledge predicates control or, in the absence of control, adaptation. The notion of certainty, as a component of knowledge, has been discussed by philosophers and scientist for centuries. (e.g. Confucius, Aristotle, Auden, Russel, and McCombs). Despite the ubiquitous understanding that certainty plays a role in knowledge, it is only recently that science has attempted to investigate this domain. In the realm of education, certainty has been ignored because the tools to assess this domain did not exist. Current computer technology remedies this by providing robust data management, analysis and storage. These are the necessary elements of Certainty Based Assessment (CBA). This will be elaborated later.
1.2 True or Correct
Entire sections of libraries are devoted to discussions of truth and correctness. This epistemological discussion is not a subject we need to address. In the context of educational reform, "No Child Left Behind" (NCLB) has provided an operational framework that defines what is true or correct implicitly within the structure of "National Standards", "Expected Outcomes" and, "Units of Practice". Each subject area is defined in detail. To meet or exceed these standards is the goal of all educational reform. Connected Learning is no exception.
1.3 Unconscious Competence and Retrieval
Imagine a student in the 5th grade who must count on his or her fingers and toes to answer the question, ?What is the sum of 9 + 9?? A priori, we know that mastery has not been achieved in basic addition since one of the goals of knowledge is to quickly render fundamental facts and skills accessible. In the language of the second example of knowledge at the beginning of this section, this student has not acquired "unconscious competence". Standardized tests, such as SAT, GRE, MAT, ACT and a host of state-specific assessments, attempt to evaluate the accessibility of knowledge as a dimension of mastery by placing time restraints on the test. Unfortunately, these tests are administered at the end of a learning cycle, or outside the context of learning entirely, and the opportunity for improve ones mastery has expired.
To facilitate mastery, latency of response time is a measurable phenomenon with the aid of computer technology. There is an axiom in business that says, "You cannot manage, what you can't or don't measure". In knowledge acquisition, this axiom not only applies but is crucial. Our project contemplates and addresses the fundamental issues in knowledge acquisition by proposing technology-based methods for measuring, storing and analyzing the learning process in real time.
1.4 Assessment of Knowledge Acquisition
At the beginning of this section we quoted the statement, ?Teaching without learning is just talk.? The way we assess whether learning is in fact taking place is some sort of "testing". In the best of all possible worlds, testing should result in a realistic measurement of both teaching and learning. However, the current educational environment tends to skew measurement with the weight being placed on the learner side of the equation.
Before addressing the different types of assessment, it is instructive to delineate assessment's purposes. Kellough & Kellough (1999) state that the purpose of assessment is:
- To assist student learning.
- To identify student's strengths and weaknesses.
- To assess the effectiveness of a particular instructional strategy.
- To assess and improve the effectiveness of curriculum programs.
- To assess and improve teaching effectiveness.
- To provide data that assist in decision making
- To communicate with and involve parents.
Please note that students are mentioned only twice. Five of the seven purposes of assessment are related to the system of education that supports student learning. As a whole, this description of the purpose of assessment is another base from which to approach learner-centered principles in education as developed by Dr. Barbara McCombs and others.
While in principle there is general agreement in the education community that the focus of the education system is the academic development of individual students, actually applying learner-center principles is difficult if not impossible within the current model of education. How, for example, do you develop and execute individualized learning plans for thirty students in a classroom? The simple answer is that, in the absence of new technologies and methodologies to improve both teaching and learning, you cannot. While many other tools are required to support a learner-centered education environment, dramatically improved assessment methods are the keystone. Sections 1.5 through 1.7 describe the three domains of assessment within the Connected Learning framework.
1.5 Diagnostic Assessment
A great many tools exist to map individual abilities and differences but diagnostic test are typically used episodically such as when a teacher suspects that a student has a learning disability. Episodic diagnosis has at least three consequences. First, without baseline diagnostic information, it is not possible to empirically construct individualized learning plans. Second, problems with student learning are typically not caught until the problem becomes acute and remediation to catch up with the class becomes extraordinarily difficult. Third, marginal problems in student performance remain undiagnosed and often are explained away as a normal variation in performance that fits within the standard distribution of a class.
Diagnosis of the aptitudes, abilities and progress of each student must be unambiguous, anticipatory, timely, frequent and dynamic. Just as content changes, learners change as the result of physical, social, personal, familial and a myriad of other circumstances. Diagnostic assessment enables the education system and the student to anticipate and solve problems.
1.6 Formative Assessment
There are several extant definitions of formative assessment. Operationally, formative assessment is periodic evaluation of student performance for the purpose of improving instruction. The outcome of formative assessment can be summarized in two extremes: continuing the lesson using alternative instruction methods, examples, etc. until each student masters the material or bypassing a portion of the lesson plan if student mastery is quickly achieved. In combination with diagnostic assessment, whole lesson units may be skipped entirely or reviewed only in brief if a student already demonstrates mastery.
While it is common in many classrooms to use low-stakes assessments such as quizzes to broadly assess relative performance, low-stakes assessments mainly serve as an early warning system for later high-stakes tests such as mid-term and final examinations. Dylan William, director of the Learning and Teaching Research Center at the Educational Testing Service, states "What I mean by formative assessment is not assessment that takes place every five to six weeks, but assessment that takes place every 10 seconds." (Sausner, 2006) Rick Stiggins at the Assessment Training Institute argues that improving scores on standardized tests and learning overall depends on instructing teachers how to conduct assessments on a daily basis (Chappuis, Stiggins, Arther, & Chappuis, 2001). However, William and Black (1998) note that formative assessment is not something that can be "tacked on" to current teaching methods; ideally, teaching, learning and assessment are an integrated whole.
The power of diagnostic and formative assessment is seen in the results from North Topsail Elementary in Hampstead, North Carolina. Principal Sylvia Lewis (Sausner, 2006) was quoted as saying, "Formative assessments catapulted the Title 1 school's proficiency rating from just below 80 percent to 98.4 in a handful of years." The means to integrate formative assessment into teaching methodology currently exists. The missing pieces are user-friendly tools to easily create assessments, and capture, store and analyze the data. This is one of the goals embodied in Connected Learning.
1.7 Summative Assessment
Summative assessment is the process of evaluating student achievement after a unit of learning, e.g. quizzes, examinations, writing assignments, etc., and represents the primary way that student achievement is evaluated today. Summative assessments typically rely upon objective testing methods such as true/false, multiple-choice and matching. Other methods such as fill-in-the-blank, short answer and essay are used with less frequency. While objective testing methods are convenient and often benefit from the use of technology such as Scantron response card readers, they are subject to test taking strategies which compromise the validity of the assessment and are severely limited in what dimension of learning they evaluate. In fact, human learning is far too complex to be relegated to simple forced choice examinations. Funderstanding (n.d.) developed an alternative to current summative assessment strategies within a concept referred to as Authentic Assessment. Funderstanding claims that Authentic Assessment accomplishes the following goals:
- Requires students to develop responses rather than select from predetermined options.
- Elicits higher order thinking in addition to basic skills.
- Directly evaluates holistic projects
- Synthesizes with classroom instruction.
- Use samples of student work (portfolios) collected over an extended time period.
- Stems from clear criteria made known to students.
- Allows for the possibility of multiple human judgments.
- Relates more closely to classroom learning Teaches students to evaluate their own work? (Funderstanding, n.d.).
TCFIR has not had an opportunity to evaluate these claims. However, the goals of Authentic Assessment illustrate a set of alternatives to current summative assessment methods that should be considered if a full understanding of student development is to be accomplished. Not all of the assessment methods embodied in Authentic Assessment are amenable to automation. However, by utilizing technology where applicable, more time can be liberated to conduct assessments that require human intervention.
1.8 Certainty-Based Assessment (CBA)
In the start of Appendix B, we present confidence or certainty as a key element of knowledge. The pioneering work of A.R Garner-Medwin at University College London forms the basis for our proposals in the domain of certainty-based assessment (CBA). Simply described, CBA adds a ?certainty? dimension to technology-mediated assessment, i.e. the student is required to place a value on how confident they are in their answers. Correct answers receive extra credit based on the certainty value given to the answer. Incorrect answers are penalized based on the certainty value.
"Automated assessment suffers from two problems that are considered here. Firstly, it seldom makes use of information about how confident a student is in the answer given, which is part of what we take into account in assessing students person-to-person. Secondly, it often involves the construction of complex questions to ensure that students cannot get good marks by a combination of partial knowledge and guesswork." (Gardner-Medwin, A.R., 1995)
Through the coordinated efforts of several medical schools in London, a project was initiated under the name of the London Agreed Protocol for Teaching (LAPT) in 1994. The resulting CBA system currently contains over 10,000 questions and processes more than one million graded questions per year at the University College London alone. TCFIR offers a demonstration of CBA technology at http://tcfir.org/lapt/sys/options.cfm based on the UCL site at http://www.ucl.ac.uk/lapt. Gardner-Medwin ( 2006) states that CBA provides several distinct advantages over current assessment methods, including:
- Encouraging students to think more carefully about questions in objective tests.
- Improving student involvement and interest in the process of self-assessment.
- Helping students identify uncertainties and misconceptions in their understanding.
- Mitigating the effects of guessing on standardized tests, improving test reliability.
- Facilitating the setting of questions for students with a large range of abilities.
- Providing immediate feedback of results for teachers and learners.
- Helping students who lack self-confidence, or are overly confident.
- Facilitating remediation.
It should be noted that certainty assessment is not an assessment domain like the diagnostic, formative or summative methods. Rather, CBA is a methodology of assessment design that can be incorporated in each of the domains as appropriate.
1.9 Bringing the Elements of Assessment Together
By enriching summative assessment with certainty data, and adding diagnostic and formative assessment methods, the Connected Learning framework proposes a ?learning lifecycle? approach to assessment that addresses all seven purposes for assessment presented in Section 1.4. Starting conditions in the learning ecology are revealed through diagnostic assessment, enabling the development of lesson plans targeted to individual students. Formative assessment provides ongoing feedback during the learning process to both teacher and student, making it possible to alter the lesson plan based on student performance. Summative assessment is then transformed into the role of a confirmation of the success of the lesson plan rather than the primary means of assessing the learning process. All assessment methods provide additional diagnostic data for the next lesson plan. Testing the certainty with which students hold knowledge provides a qualitative, diagnostic dimension to that enables even more precise fine-tuning of teaching and learning. With additional technology relating student performance to particular teaching methods and materials, parents, teachers and administrators alike can evaluate the effectiveness of the learning ecology with more and better data than is produced currently. Assessment becomes the means not only for enabling significantly higher levels of student achievement but is also instrumental in the Connected Learning principle of developing a self-reforming, self-improving and self-documenting system of education.
1.10 Applying Technology-Mediated Assessment
High standards for assessment are critical in the Connected Learning framework. Based on what we have developed in this section about knowledge, and diagnostic, formative and summative assessments, let us speculate for a moment about where technology-mediated techniques can apply.
Knowledge Acquisition
- Present standards-based content to know
- Computer generated multi-media presentation in all subject areas.
- Repetitive practice in all subject areas.
- Consistent high quality.
- Assess certainty in answers
- Most easily implemented in fill-in-the-blank, matching, multiple-choice and true/false.
- Possible to implement with other test types.
- Measure actionable mastery
- Possible for all of the above by measuring and recording answer latency.
- More subtle measures of performance may be developed, e.g. relating time spent reading a section of material to formative assessment results thereby deriving overall reading traits like speed and comprehension.
General Purposes of Testing
- To assist student learning.
- Provide immediate feedback in real time
- Provide graphic representation of progress
- Record question level granularity
- Make it fun
- To identify students? strengths and weaknesses.
- Subject level granularity will clearly reveal level of success and failure
- To assess the effectiveness of a particular instructional strategy.
- Question level granularity will reveal teaching effectiveness
- Agglomeration of class data will discriminate individual or systemic failures
- To assess and improve the effectiveness of curriculum programs.
- Question level granularity will reveal teaching effectiveness
- Agglomeration of class data will discriminate individual or systemic failures
- To assess and improve teaching effectiveness.
- Granularity will reveal instructional quality
- Granularity will reveal teacher preparation
- To provide data that assist in decision making
- Provide classroom management with intricate detail
- To communicate with and involve parents.
- Graphic presentations will aid in communicating with parents
- Details based on small-scale evaluations will help parents understand the unique characteristic of their child
- Progress measurement will aid in earlier intervention when there is a problem
Summative / Authentic Assessment
- Requires students to develop responses rather than select from predetermined options
- It is possible to merge content presentation with assessment
- It is also possible to offer multiple presentation and assessment options.
- Elicits higher-order thinking in addition to basic skills
- Simulations can be created that stimulate both deductive and inductive reasoning.
- Connecting subject areas for a more elaborate and enriched understanding, e.g. how developments in timekeeping devices,
- Directly evaluates holistic projects
- Synthesizes with classroom instruction
- Tools can be created to accomplish this integration
- Uses samples of student work (portfolios) collected over an extended time period
- Student data of an objective nature can be store easily and can be used to form both a quantitative and qualitative map
- Stems from clear criteria made known to students
- Based on empirically derived data individual learning plans can be created and modified as needed.
- Progress against those plans can be accessed in real time
- Allows for the possibility of multiple human judgments
- Clever assessments can be created that test judgment
- Relates more closely to classroom learning Teaches students to evaluate their own work
- Certainty assessment require the student to self-assess
Curriculum Design
- Analysis of patterns revealed by associating assessment data with lesson plans, learning/teaching materials, teaching methodologies, etc. coupled with Internet-enabled pooling of data create opportunities to identify effective education elements.
- Proven lesson plans, i.e. those associated with high assessment results, could be accessed and used by teachers as a whole or in parts.
- Diagnostic data could be used to form ?student types?; lesson plans and curricula could then be associated as effective with certain student types.
- Data-enabled curriculum design; assessment and standards integration via technology would allow for more complex approaches to subjects while still meeting education standards.
- Integrated, multi-subject designs.
i. Theory-to-application, e.g. geometry/trigonometry and applications in navigation, construction, engineering.
ii. Connections designs, e.g. how a watch, sextant and GPS receiver can be the basis for exploring history, physics, geography, mathematics, etc.
- Bounded self-study designs.
- Collaborative designs.
Appendix B Bibliography
Black, P., William, D. (1998). Inside the Black Box: Raising Standards Through Classroom Assessment. Phi Delta Kappa International. Retrieved June 20, 2006, from http://www.pdkintl.org/kappan/kbla9810.htm.
Chappuis, S., Stiggins, R., Arther, J., & Chappuis, J. (2001). Assessment for learning: an action guide for school leaders, (2<sup>nd</sup> ed.). Portland, OR: Assessment Training Institute.
Funderstanding (n.d.). Authentic assessment. Retrieved June 28, 2006, from http://www.funderstanding.com/authentic_assessment.cfm.
Gardner-Medwwin A.R. (2006) Confidence-Based Marking - towards deeper learning and better exams. In Innovative Assessment in Higher Education, Bryan C and Clegg K (eds) Taylor and Francis, London
Gardner-Medwin, A.R. (1995). Confidence assessment in the teaching of basic science. Association for Learning Technology Journal, 3, 80-85. Retrieved June 13, 2006, from http://www.ucl.ac.uk/%7Eucgbarg/tea/altj.htm
Kellough, R.D. & N.G. Kellough (1999). Middle school teaching: A guide to methods and resources (3<sup>rd</sup> ed.). Upper Saddle River, NJ: Merrill/Prentice Hall.
Sausner, R. (2006). Making assessment work. District Administration. Retrieved June 27, 2006, from http://www.districtadministration.com/page.cfm?p=1188.
University College London (2005). Certainty-based, or confidence-based marking (CBM). Retrieved June 3, 2006, from http://www.ucl.ac.uk/lapt/index.htm.


Comments: 21
All current methods of assessment are amenable to the "formative" processes. Portfolio assessment is just another method of looking at what is acutually being learned.
In practice, few teachers are willing to self-assess in a meaningful way. For example, a test is given and the highest grade in the class is 70% and scores below that are normally distributed. Rather than accepting that the knowledge being tested has not been effectively presented, there is the temptation is to make 70 an "A" and scale the rest accordingly. Rarely will the praxis be examined. Rarely will the teacher say, "I failed to communicate the subject in an effective way." To do so, would be a formative judgement and would reflect on the teacher more than the learner.
Portfolio can reflect formative data as can any form of assessment. The difference is the mid-set the teacher takes on looking at assessment results.
We all have had teachers who have taught the same way year after year with little revision. I had a professor in graduate school who used the same lectures for 20 years. I know so because one of my other professors gave me his personal notes, from the same professor's class that he had taken years before. Clearly self-assessment was not part of this professors style.
Formative assessment is not understood by most of educators I encounter. If they understand it they are not equiped to implement it. They are far too willing to blame the learner than to take responsibility for the ineffective teaching.
This statement will not make me popular with many of my colleagues.
The tests provide massive amounts of data that could help teachers be more effective, if used formatively. I.e It is not the tests that are the problem but rather what is learned and how it is used to improve instruction.
Wouldn't that require more detailed standards than we currently have. I assume you are looking at it in the form of a checklist, of skills and knowledge that can be checked off. However, I am not certain if that is what you mean.
Theses test are linked with "accountability" and this is about measuring precisely what is actually happening in schools.
Accountability is an alien concept to most educators because it address the effectiveness of both content and methods and challenges the notions of academic freedom.
Reid, as you know all too well, under NCLB, school districts are called upon to collect data on a number of levels - including student outcomes - in an effort to improve academic performance.
Districts that utilize research-based models are making some significant gains, particularly in language arts and mathematics. Other districts, it seems, are simply throwing up their hands, especially if they don't have knowledgeable administrators who can determine if the district is on target when it comes to assessment.
About your comment on accountability (the fact that it's "alien" to most educators): That is so true!
I'm looking forward to seeing more from this group.
With NCLB, as well as education reform, the devil is in the details.
Would you agree that for standardized testing to be effective in measuring the skills of particular teachers, as some have suggested they be used, students would need to be tested at the beginning and end of each school year. In other words, students improvements would be evident each school year on an individual level. Then, you've eliminated variables such as prior teachers. Of course, it still doesn't account for things like family life during the course of the year, but it does indicate if it is a problem.
I am a scientist! I believe in empirical research. As Deb S. has said research based models have posted significant gains. The problem with all educational reform discussions there is more rhetoric than research. My team recognizing this, has proposed a framework for educational reform. see: http://tcfir.org/whitepapers/Connected%20Learning%20Framework.pdf.
It is not a plan but rather a model based on empirical research to support reform. I recommend anyone, who is interested, to read it. It is an unfinished work that is evolving daily and has been 2 years in the making.
Back to the discussion of assessment. At the beginning of every year each classroom teacher makes a dangerous assumption. That is, "The previous teachers have done their job as evidendenced simply by the fact that the class in front of me is here." Based on that assumption they proceed forward with what they believe they should teach.
In science and the scientific method, you get a baseline measurement for the sample group, you effect some change on the group, and them you measure what result has occured. Applying, this process to education you would test at the beginning of the year, you would then teach and then you would measure for outcome. Simple.
In the present system you start with what may be a faulty assuption, teach and measure the outcome at the end of the term. By leaving out the first step you have no idea what produced the result you get on your final.
Thirty years ago, in my general Psych class, I created 3 totally different final exams that tested the total body of information I wanted my students to know. I gave version 1 on the first day of class. I gave version 2 at mid-term. Then I gave the final. These exams were a combination of multiple choice, fill in the blank and short essays. The short essays were designed to show how well they synthesized the factual content of the course. I had another professor give me a copy of his final and for experimental purposes, I gave my class his exam. I had him grade his exam.
All my class skunked his best students.
I didn't teach the test. I had discussions of critical topics and let them run where they went. An unintended consequese of this system was that I had the lowest absentee rate of any instructor.
The academic Dean refused to acknowledge the success. He said, "this is not the way we handle testing."
Specifically, I am wondering if web courses will ultimately redefine our current understanding of "tests" and the organization of knowledge.
I'll admit that I skimmed your article, not willing to invest the time to really absorb it. I'm long out of schools(*s*) and have not kids but have worked with, (and sometimes FOR) the best and the worst of the last two generations...
Reading your comments is heartening. I bet they do everything they can to make you a pariah in the field.
They should not.
The and I repeat THE problem has been unsupervised unassayed and no dire consequences held for the failures of experimentation on the kids.
I am glad to see you building a method to find the culprit, and quantify the methods attempted based on empirical data.
This is not fair. I expected something edgey from a guy with your alias. You'll have to do better. I've appointed you to be my chief protagonist.
I have two girls, ages 12 and 8. My 12 year old is an honor student in the Seventh Grade, and my 8 year old is an honor student in the Third Grade, but it is because we as parents worked with them at an early age, as much as the school system they are in.
My 12 y/o says she wants to be a vet. I told her that if she continues to be at the top of her class, and applies herself, she can do anything she wants to. Positive reinforcement. She also asked for a 3500 Series Crew Cab Pickup for her 18th birthday. I told her "Validictorian (sp?) earns a used 3500. Anything less, you buy it yourself."
I agree that standardized test are no better than other tests, as well.
1. they are easy to administer
2. they are easy to grade
3. the teachers guide to the textbook usually includes them.
From a learning theory point of view, they have little or no value relating to cognitive processes (thinking). They do, however, measure the facts a student has acquired (rote memorization).
The choice is what kind of people do we want?
I like thinkers.
I forgot the first question. Have your Senator or Congress person call me. Tell your friends to join and comment. I'm trying to start a revolution.
NEA - no chance!
This is all very interesting, your article, very "technical" for a non educational person, but understandable if one takes the time.
The comments and your responses are also very useful.
As a 50+ year old college student, I see a sad lack of coherent thinking among my fellow students, despite the fact that most (far from all, but most) of the profs really try to encourage "synthesis" of information.
I think the root problem is threefold: 1) parents are too busy to give their children the encouragement, and enrichment experiences that MrEl F apparently HAS done with his daughters; 2) schools, and teachers, are forced to accommodate the lowest common denominator rather than encourage excellence; and 3) everyone has forgotten that these are human beings we're discussing -- on both sides of the podium -- not merely data points.
I applaud your efforts to put the problem into a form which can be presented to, and understood by, the politicians who (unfortunately) are expected to solve this riddle. However, until we can return to the days of accepting that every person has different interests and abilities, which effect skills mastery, we won't get far.
If you have the stomach for more dense reading, I suggest that you read our paper on educational reform.
http://tcfir.org/whitepapers/Connected%20Learning%20Framework.pdf
Appendix A deals with your final point.
Also, we are suggesting that the Internet offers the opportunity to provide content that encourages 24/7/365 learning for parents, teachers and students. With all stakeholders on the same page.
I think I provide the answer in my publication, "A Curriculum of Unquestionable Value and Lasting Relevance." It would be more appealing and maybe easier to use if it we a computer program but it does account for learning and reveal what needs to be tested on an anual basis just to check the teacher's evaluation.