    The reliability result of the ability to distinguish observation from inference is really good that we got so much high reliability among six raters and one self-rater. I totally agree with Paul’s idea that the more levels of attainment of learning goals, the less possible to get higher percent agreement.

    Thereby, I paid attention on the ability of technical description, which Hunter shared to us. Overall we got relatively lower percent agreement compared with the ability of distinguishing observation from inference. However, I realized that Paul, Mike and Panpan still had relatively higher percent agreement in this ability. So intuitively I think at least part of this is due to the professional development, because actually Paul, Mike and Panpan really obtained much more professional development in this reliability evaluation thing than others.

