Plannedobsolescence.net

Evolving Standards and Practices in Tenure and Promotion Reviews

2014-02-11

The following is the text of a talk I gave last week at the University of North Texas’s Academic Leadership Workshop. I’m hoping to develop this further, and so would love any thoughts or responses.

I’m happy to be here with you today, to talk a bit about evolving standards and practices in promotion and tenure reviews. Or, perhaps, about the need to place pressure on those standards and practices in order to get them to evolve. Change comes slowly to the academy, and often for good reason, but we find ourselves at a moment in which uneven development has become a bit of a problem. Some faculty practices with respect to scholarly work have in recent years changed faster than have the ways that work gets evaluated. If we don’t make a considered effort to catch our review processes up to our research and communication practices, we run the risk of stifling innovation in the places we need it most.

I want, however, to start by noting that most of what I am proposing here is intended to open a series of issues for discussion, rather than presenting a set of answers to the problems. Every university, every field, indeed, every tenure case brings different needs and expectations to the review process; it’s only in teasing out those needs and expectations that you can begin to craft a set of guidelines that will adequately represent your campus’s values and yet be supple enough to continue to represent those values in the years to come.

So first, a bit of recent history around these issues, before I move to the kinds of issues I believe we need to be considering with respect to tenure processes. In 2002, then-president Stephen Greenblatt sent a letter to the 30,000 members of the Modern Language Association, alerting them to a coming crisis in tenure review processes. The failing fiscal model under which university presses operate, he noted, was resulting in the publication of fewer and fewer scholarly monographs, a reduction being felt most acutely in the area of first books; as a result, work that was of perfectly high quality but that did not present an obvious market value was in danger of not finding a publisher. Unless departments were willing to think differently about their review processes, recognizing what the “systemic” obstacles facing all of scholarly communication, Greenblatt argued,

people who have spent years of professional training — our students, our colleagues — are at risk. Their careers are in jeopardy, and higher education stands to lose, or at least severely to damage, a generation of young scholars.

In considering what might be done, Greenblatt noted that

books are not the only way of judging scholarly achievement. Should our departments continue to insist that only books and more books will do? We could try to persuade departments and universities to change their expectations for tenure reviews: after all, these expectations are, for the most part, set by us and not by administrators. The book has only fairly recently emerged as the sine qua non and even now is not uniformly the requirement in all academic fields. We could rethink what we need to conduct responsible evaluations of junior faculty members.

There are some things that might bring one up short here: for instance, Greenblatt’s acknowledgement that the book is not “uniformly the requirement in all academic fields” of course masks the degree to which the book-based fields are outliers in the academy today. But nonetheless, those fields’ reliance on the book as the gold standard for tenure was becoming, for a host of reasons, problematic, and thus Greenblatt urged departments to reconsider their review practices, as they

can no longer routinely expect that the task of scholarly evaluation will be undertaken by the readers for university presses and that a published book will be the essential stamp of a young scholar’s authenticity and promise.

Departments, in other words, must step forward and establish means of determining for themselves where appropriate evidence of a young scholar’s “authenticity and promise” lies.

In the years following this letter, the MLA created a task force charged with examining the current state of tenure standards and practices and making recommendations for their future. That task force issued its final report in December 2006, presenting a list of 20 recommendations, supported by nearly 60 pages of data and analysis. Their recommendations included things like:

The profession as a whole should develop a more capacious conception of scholarship by rethinking the dominance of the monograph, promoting the scholarly essay, establishing multiple pathways to tenure, and using scholarly portfolios.

And:

Departments and institutions should recognize the legitimacy of scholarship produced in new media, whether by individuals or in collaboration, and create procedures for evaluating these forms of scholarship.

And:

Departments should conduct an in-depth evaluation of candidates’ dossiers for tenure and promotion at the departmental level. Presses or outside referees should not be the main arbiters in tenure cases.

And:

Departments and institutions should facilitate collaboration among scholars and evaluate it fairly.

None of these recommendations seem terribly controversial, and yet here we find ourselves. Over seven years have passed since the task force report; nearly twelve have passed since the Greenblatt letter. And by and large, standards and practices in tenure and promotion reviews have changed but little. By and large, the book remains the gold standard in what are still referred to as “book-based fields,” and departments still find themselves stymied when it comes to evaluating digital work, collaborative work, public work, and the like.

It would not be unreasonable to ask whether Stephen Greenblatt was simply being a bit alarmist in the sense he conveyed of an impending crisis for the faculty. We do not appear to be surrounded by a lost generation of scholars whose prospects were damaged by our continued adherence to the book standard in the face of university press cutbacks — and yet, given that faculty who are not awarded tenure leave our midst, and that we are not haunted by the lingering specters of unpublished books, it’s possible that the damage is nonetheless being inflicted, just in a way that we are able to keep outside our awareness. One clear, if anecdotal effect of our refusal to change, however, may well be precisely how much we have stayed the same; I have been told by several junior faculty members, and have heard the same thing at second-hand from many others, about having been counseled by senior colleagues that they should put aside their more experimental projects and focus on the traditional monograph until tenure is assured. The counselor is generally well-intentioned, wanting to help his or her junior colleague have as frictionless an experience of the review process as possible. But the outcome, too often, is that the junior faculty member is either made risk-averse or, in a positive sense, ushered into the more reliable reward channels of the ways that things are usually done, and as a result never returns to the transformative work originally imagined. And that disciplinary lockdown then gets transmitted on to the next round of junior colleagues. And so we continue, as a field, to rely on the monograph as the gold standard for tenure, and we continue to find ourselves baffled by the prospect of evaluating anything else.

This is not to say, however, that there has not been change — even in the most hide-bound of book-based fields — over the last twelve years. Scholars today are communicating with one another and making their work public in a range of ways that were only beginning to flicker into being in 2002. Many faculty maintain rich scholarly blogs, either on their own or as part of larger collectives, through which they are publishing their work; others are working on a range of small- and large-scale corpus building, datamining, mapping, and visualization projects, all of which seek to present the results of scholarly research and engagement in rich interactive formats. Projects in a wide range of digitally-inflected fields across the humanities, sciences, and social sciences are both using and developing a host of new methodologies, both for research and for the communication of the results of that research. And these projects are not just transforming their fields, but also creating a great deal of interest in scholarly work amongst the broader public.

And yet I visited a university last fall whose form for the annual professional activities report asks faculty members to list their (1) book publications, (2) peer-reviewed journal articles, (3) major conference presentations, and so on, finally getting to “web-based projects” somewhere just above volunteer service in the community. It’s just a form, of course, but in that form is inscribed the hierarchy of what we value, as evidenced by what we actually reward in our evaluation and merit review processes. And if we are going to take web-based work as seriously as traditionally published work, we need to manifest that in those reward systems.

However, I do want to be clear about something: What I am arguing here today is not that digital projects of whatever variety should be treated as the equivalent of a book or a journal article. In fact, attempting to draw those equivalences can get us into trouble, as digital work demands its own medium-specific modes of assessment. Digital projects are often radically open, both in their mode of publication and their mode of peer review; they are often process-oriented, without a clear moment of “publication” or a clear completion date; they are very frequently code-based, and often non-linear, in ways that require that they be experienced rather than simply read. And too often review processes eliminate that possibility; not only do our forms rank web-based work as unimportant, but our processes require that such work be printed out and stuck in a binder. This is clearly counter-productive; we cannot continue evaluating new kinds of work as if it has been produced and can be read just like the print-based work we’re accustomed to.

But what I’m after here is not a new set of equally rigid processes that better accommodate the particularity of the digital. Rather, our review processes need to develop a new kind of flexibility — in no small part because developing a set of criteria that perfectly deals with all of today’s forms of scholarly communication will in no way prepare you for tomorrow, or next year. The fact of the matter is that scholarly communication itself is in a period of profound change, profound enough that change itself is the only certainty. And so we need guidelines that will enable the faculty and the administration together to locate the core values that we share and to establish processes that will take each case on its own terms, while nonetheless proceeding in ways that can be fairly applied to all cases.

In considering such a transformation, I believe that we need to begin by thinking differently about what it is we’re doing in the tenure review process in the first place. We have long treated the tenure review, and to a lesser extent the review for promotion to full, as a threshold exercise: an assessment of whether the candidate has done enough to qualify. The result, I believe, is burnout and disgruntlement in the associate rank. There’s a reason, after all, why The Onion found this funny, and it’s not just about the privileges of lifetime tenure producing entitled slackers.

Assistant professors run the pre-tenure period as a race and, making it over the final hurdle, too often collapse, finding themselves exhausted, without focus or direction, depressed to discover that what is ahead of them is only more of the same. The problem is not the height of the hurdles or the length of the track; it’s the notion that the pre-tenure period should be thought of as a race at all, something with a finish line at which one will either have won or lost, but will in any case be done. I believe that we can find a better means of supporting and assessing the careers of junior faculty if we start by approaching the tenure review in a different way entirely, thinking of it not as a threshold exercise but instead as a milestone, a moment of checking in with the progress of a much longer, more sustained and sustainable career.

Here’s the thing: We hire candidates with promise, expecting that their careers will be productive over the long term, that they will engage with their material and their colleagues, and that they will come to some kind of prominence in their fields. The tenure review, at the end of the first six years of those careers, should ideally not be a moment of determining whether those candidates have thus far done X quantity of work (that is, that they have done enough to earn tenure, and can safely rest), but rather of asking whether the promise with which those candidates arrived is beginning to bear out. Let me say that again: beginning to bear out. The question we are asking, at tenure, should not be whether the full potential of a candidate has been achieved, but whether what has been done to this early point in a career gives us sufficient confidence in what will happen over the long haul that we want the candidate to remain a colleague for as long as possible. In order to figure that out, the questions we ask about the work itself should not — or at least should not only — be about its quantity; rather, we should focus on its quality. And there are a couple of different ways of thinking about and assessing that quality: first, through the careful evaluation by experts in the candidate’s field, and second, through an exploration of the evidence of the impact the candidate’s work is having in his or her field.

Such a focus on impact might help us more fairly evaluate the new kinds of digital projects that many scholars today are engaged in. But they also might encourage us to reassess a range of forms of work that have not been adequately credited in recent years. In fact, I would argue that the reforms that we need in our tenure review processes are not just about accommodating the digital at all. We also need to acknowledge and properly value forms of intellectual labor that have long been done by the faculty but that for whatever reason have gone undervalued. In my own area of the humanities, such work includes translation, or the production of scholarly editions, or the editing of scholarly journals. These are forms of work that have long been part of academic production, but that have by and large been treated as “service to the field.” And yet — just to pick up one of those examples — what more powerful position in shaping the direction of a field is there than that of the journal editor? The impact of such an editor across his or her term is likely to have a far greater and far longer-lasting influence on his or her area of study than any monograph might produce — and yet only the monograph, in most institutions, will get you promoted.

This is just one of the kinds of problems that we need to encounter. But again, I want to emphasize that it’s not enough simply to add “digital work” or “journal editing” to the list of kinds of work that we accept for tenure and promotion, not least because the impulse then is to apply current standards to those objects: are there kinds of journals that “count,” and kinds that don’t? Does the journal have to have a specified impact factor? I’m sure you can imagine more such questions — questions that I’m convinced lead us in the wrong directions, toward increasing rigidity rather than flexibility. Instead, I want to head off in a different direction. In the rest of my time this morning, I want to sketch out a few of the ways that our thinking about the review process might change in order to help produce the results we’re actually aiming for. The new ways of thinking that I’m urging today may require us to give up our reliance on some relatively easy, objective, quantitative measures, in favor of seeking out more complex, more subjective qualitative judgments — but I would suggest that these kinds of complex judgments about research in our fields are the core of our job as scholars, and that we have a particular ethical obligation to take our responsibility for such judgments seriously when they determine the future of our colleagues’ careers. This different direction will also require us to think as flexibly as we can about how our practices should not only change now, but continue to evolve as the work that junior scholars produce changes.

So, I want to float a number of principles meant to instigate some new ways of thinking about the tenure standards and processes of the future. Though these are pitched as imperatives, they are not specific practices, but rather considerations for the creation of practices. First:

(1) Do not let “but we don’t know how to evaluate this kind of work” stand as a reason not to evaluate it.

Many disciplinary organizations have been hard at work developing criteria for evaluating new kinds of scholarly work. For instance, the MLA’s Committee on Information Technology developed such a set of best practices for the evaluation of digital work in MLA fields back in 2000, and has recently updated those guidelines. The CIT has further created an evaluation wiki, which includes information such as a breakdown of types of digital work. And, perhaps most importantly, the CIT has led a series of workshops before the annual convention designed to give department and campus leaders direct experience of the kinds of questions that need to be asked about digital work, and the ways that such evaluation might proceed. In conjunction with that workshop, the CIT has produced a toolkit. And the MLA also has guidelines for the evaluation of translations, and guidelines for the evaluation of scholarly editions, among other such guidelines.

And this is just the MLA. Other scholarly organizations have done similar work on the sorts of nontraditional projects that are appearing in their own fields. And several universities have developed their own policies for how such work should be evaluated, including Emory University and the University of Nebraska at Lincoln.

So there are excellent criteria out there that can be used in evaluating many non-standard kinds of scholarly work. Review bodies, from the department level up to the university level, must familiarize themselves with those criteria and put them to use in their evaluations.

(2) Support evaluator learning.

Despite the existence of these excellent criteria for evaluating new work, however, many faculty, especially those who have long worked in exclusively traditional forms, need support in beginning to read, interpret, and engage with digital projects and other new forms of scholarly project. This need is of course what led the MLA’s committee on information technology to hold its pre-convention workshops; similar kinds of workshops have been held at the summer seminars of the Association of Departments of English and the Association of Department of Foreign Languages, and at NEH-funded summer workshops. On the local level, you might enlist the scholars on your campus who are doing digital work or other forms of nontraditional scholarship in leading similar workshops for the faculty and administrators who play key roles in the tenure review process.

(3) Engage with the work on its own terms, and in its own medium.

Supporting evaluators in the process of learning how to engage with new kinds of work is crucial precisely because the work under review must be dealt with as it is, as itself. If I could wave my magic wand and eliminate one bit of practice in tenure and promotion evaluations, it would probably be the binder. More or less every year I hear reports from scholars whose work is web-based but who have been asked to print out and three-hole-punch that work in order to have it considered as part of their dossiers. Needless to say, eliminating the interaction involved in web-based projects undermines the very thing that makes them work. As the MLA guidelines frame it, “respect medium specificity” — engage with new work in the ways its form requires.

(4) Dance with the one you brought.

In the same way that the work demands to be dealt with on its own terms, it’s crucial that tenure review processes engage with the candidates we’ve actually hired, rather than trying to transform them into someone else. While it’s tempting to advise junior scholars to take the safer road to tenure by adhering to traditional standards and practices in their work, such advice runs the risk of derailing genuinely transformative projects. Particularly when candidates have been hired into positions focused on new forms of research and teaching, or when they have been hired because of the exciting new paths they’re creating, those candidates must be supported in their experimentation. In creating that support, it’s particularly important to guard against doubling the workload on the candidate by requiring them both to complete the project and to publish about the project, or worse, to complete the project and do traditional work as well. This is a recipe for exhaustion and frustration; candidates should be encouraged to focus on the forms of their work that present the greatest promise for impact in their fields.

(5) Prepare and support junior faculty as they “mentor up.”

My emphasis on supporting the candidates that you have doesn’t mean those candidates won’t need to persuade their senior colleagues of the importance of their work. Scholars working in innovative modes and formats must be able to articulate the reasons for and the significance of their work to a range of traditional audiences — and not least, their own campus mentors. In theory, at least, this is the case for all scholars; it’s the purpose that the “personal statement” in the tenure dossier is meant to serve. For scholars working in non-traditional formats, however, there is additional need to explain the work to others, and to give them the context for understanding it. That process cannot begin with, but rather must culminate in, the personal statement. Throughout the pre-tenure period, candidates should be given opportunities to present their work to their colleagues, such that they have lots of experience explaining their work — and ample responses to their work — by the time the tenure review begins. They also need champions — mentors who, having examined the work and coming to understand its value, will help them continue to “mentor up” by arguing on behalf of that work amongst their colleagues.

(6) Use field-appropriate metrics.

Every field has its own ways of measuring impact, and the measures used in one field will not automatically translate to another. A colleague of mine whose PhD is in literature, and who began her career as a digital humanist, now holds a position that is half situated in an English department and half in an information science department. Her information science colleagues, in beginning her tenure review, calculated her h-index — and it was abysmal. The good news is that her colleagues then went on to calculate the h-indexes of the top figures in the digital humanities, and discovered that they were all equally terrible. Metrics like the h-index, or citation counts, or impact factors simply do not apply across all fields. It’s absolutely necessary that we recognize the distinctive measures of impact used in specific fields and assess work in those fields accordingly.

(7) Maybe be a little suspicious of counting as an evaluation method.

We tend to like numbers in our assessment processes. They feel concrete and objective, and some of them are demonstrably bigger than others. The problem is that we tend only to count those things that are countable, and too often, if it can’t be counted, it doesn’t count. But as qualitative social scientists — much less humanists — would insist, there is an enormous range of significant data that cannot be captured or understood quantitatively. Citation counts, for instance: such metrics can tell us how often an article has been referred to in the subsequent literature, but they can’t tell us whether the article is being praised or buried through those citations, whether it’s being built upon or whether it’s being debunked. So while I’m glad that problematic metrics like journal impact factor are gradually being replaced with a more sophisticated range of article-level metrics, I still want us to be a bit cautious about how we use those numbers. This includes web-based metrics: hits and downloads can be really affirming for scholars, but they don’t necessary indicate how closely the work is being attended to, and they aren’t comparable across fields and subfields of different sizes. If we’re going to use quantitative metrics in the review process, they need careful interpretation and analysis — and even better, should be accompanied by a range of qualitative data that captures the reception and engagement with the candidate’s work.

(8) Engage appropriate experts in the field to evaluate the work.

It is, by and large, the external reviewers that we have relied upon to produce the qualitative assessment of the tenure dossier. These experts are generally well-placed, more senior members of the candidate’s subfield who are asked to evaluate the quality of the work on its own terms, as well as the place that work has within the current discourses of the subfield. Where candidates present dossiers that include non-traditional work, however, we must seek out external reviewers who are able to evaluate not just the work’s content — as if it were the equivalent of a series of journal articles or a monograph — but also its formal aspects. These experts can and should also uncover and evaluate the specific evidence of the work’s impact within the field. In the last couple of years, a couple of colleagues and I have all had the experience of being asked to undertake a review of a tenure candidate’s digital work, and have been specifically asked by those campuses to account for the technical value of that work and the significance that it has for the field. This kind of medium-specific review is, I would argue, necessary for all forms of nontraditional work: a candidate whose dossier includes translation should have at least one qualified external reviewer asked to focus on the significance of the translation; a candidate whose dossier includes journal editing should have at least one qualified external reviewer asked to focus on the significance of that editorial work for the field.

(9) But do not overvalue the judgments of those experts.

The external reviewers that are engaged by a department or a college to assess the work of a candidate are often the best place to evaluate the quality of that work, its place within the subfield, its significance and reception, and the like. But all too often these reviewers are called upon — or take it upon themselves — to make judgments that are outside the scope of their expertise. It would be best for us to refrain from asking, or even specifically enjoin, reviewers from indicating whether a candidate’s work would merit tenure at their institution, or whether a candidate is among the “top” scholars in their field. Such comparisons rely on false equivalences among institutions and among scholars, and they are invidious at best.

Even more, departments must use the judgments of those experts to inform their own judgment, and not supplant it. Departments know the internal circumstances and values of the institution in ways that external reviewers cannot. And while the members of a departmental tenure review body might not be experts in a candidate’s specific area of interest, bringing in such experts cannot be used absolve them of responsibility for exercising their own judgments, including engaging directly with the candidate’s work themselves.

(10) Avoid (or at least beware) the false flag of “objectivity.”

The desire to externalize judgment — whether by relying upon quantitative metrics or on the assessments of external reviewers — is understandable: we want our processes to be as uncontroversial, as scrupulous, and therefore as objective as possible. And there are certain subjective judgments — such as those around questions of “collegiality” or “fit” — that should not have any place in our review processes. But aside from those issues, we must recognize that all judgment is inherently subjective. It is only by surfacing, acknowledging, and questioning our own presuppositions that we can find our way to a position that is both subjective and fair. This is a kind of work that scholars — especially those in the qualitative social sciences and the humanities — should be well equipped to do, as it’s precisely the kind of inquiry that we bring to our own subject matter. And in this line, I want to note that the external judgments that we seek from outside reviewers are no more objective than are our own. If anything, external reviewer testimony itself requires the same kind of judgment from us as does the rest of the dossier.

Moreover — and I have a whole other 45-minute talk focusing on this issue — we need to acknowledge that “peer review” is not itself an objective practice, and therefore an objective marker of quality research. And there isn’t just one appropriate way for peer review to be conducted. Many publications and projects are experimenting with modes of review that are providing richer feedback and interaction than can the standard double-blind process; it’s crucial that those new modes of review be assessed on their own merits, according to the evidence of quality work that they produce, and not dismissed as providing insufficiently objective criteria for evaluation.

(11) Reward — or at least don’t punish — collaboration.

Along those lines: I have been told by members of university promotion and tenure committees that an open peer review process, or other forms of openly commentable work, would doom a tenure candidate because anyone who participated in that process would be excluded as a potential external reviewer. The intent again is objectivity: any scholar who has had any contact with the candidate’s work, or has engaged in any communication with the candidate, or has participated in any projects with the candidate, could not possibly be “objective” enough to evaluate the work.

This is on the one hand the kind of adherence to the false flag of objectivity that I think we need to get away from, and on the other a highly destructive misunderstanding of the nature of collaboration in highly networked fields today. I understand the impulse, to ensure that the judgment provided by an external reviewer is as focused on the work as possible, without being colored by a personal relationship. But there are degrees, and we need to be able to make distinctions among them. At my own prior institution, the line was one about personal benefit: if potential external reviewers stand to gain directly in their own careers from a positive outcome in the review process — a dissertation director who becomes more highly esteemed the more highly placed his former advisees are; a co-author whose work gains greater visibility the more her partner’s career advances, and so forth — such reviewers should obviously not be engaged. But other levels of interaction should not disqualify reviewers, including co-participants in conference sessions, commenters on online projects, and so forth. We need to recognize that a key component of impact on a field is about those kinds of connection: we should want tenure candidates to be developing active relationships with other important members of their fields, to be working with them in a wide variety of ways. Such relationships should be disclosed in the review process, but they should not be used to eliminate the reviewers who might in fact be the best placed to assess the candidate’s work.

The key thing, again, is that the tenure review should be focused on assessing the impact that the candidate’s work is beginning to have on its field, and the confidence that impact to this point gives you about the importance of the work to come. Each aspect of the standards and processes that you bring to the tenure review process should be reconsidered in that light: are the measures you are using, the evaluators you are engaging, the ways the work is being read or experienced, are all of these aspects producing the best possible way of assessing a career in process, and the most responsible way of considering its future.

I want to close with one crucial question that remains, however, and it’s a big one. This process of change is huge, and wide-ranging, and it strikes at the heart of academic values. Who will lead it? I do not know the situation at your institution well enough to say that this is definitively true here, but I will say that I know of institutions where administrative initiatives to reform processes like tenure and promotion reviews are met with faculty resistance to having standards imposed on them, and yet when faculty are tasked with beginning such reform they often disbelieve that the administration will listen to them. Which is to say that among the things that has to be done in this process is creating an atmosphere of trust and collaboration between faculty and administration, such that the work can — and will — be done together.

This is not an easy. It’s not just a matter of changing a few phrases in the current guidelines to permit consideration of new kinds of work. But I firmly believe that a real investment in envisioning a new set of tenure standards and practices can have a transformative effect on our campuses, opening discussions about scholarly values, promoting innovations in both research and teaching, and supporting the new ways that scholars are connecting not just with one another but with the broader public as well. I look forward to hearing your thoughts about how such a process might go forward.