On June 25, 2009 by Jamie Madigan
I've been doing some research lately on developing employment test content from scratch, just like Grandma used to make. One of the steps in this process that's pretty widely recommended is the creation of a test blueprint (also sometimes called a test specification). Basically it's a short document that maps out what the test is about, what content areas it will tap, and some other information.
At first I glossed over this because it seemed like an example of over documentation and a likely candidate for omission from a project if you're in a hurry. As I got further closer and closer to the point of actually writing test content, though, I realized that this is really an invaluable step. Mainly because it forces you to map things out and think carefully about what your test should look like. It will also likely keep you from wasting the time of item writers --if you need twice as many vocabulary items as math, you can appropriately direct the activities of your writers.
So this post is all about creating a simple test blueprint for an employment test. It's not meant to be definitive, and if you have suggestions or pointers to other resources, please share them in the comments section.
First, though, we'll need a job. Let's just pick one totally at random, like say, Human Cannonball, which is Code # 159.347-018 in the Dictionary of Occupational Titles. Just picked that one totally at random. Honest. Now, of course, any test development activity should be preceded by a thorough job analysis, so let's assume you did that and identified these knowledge, skills, and abilities to include in your selection program:
Right. Okay. Let's look at what sections to include in your test blueprint:
Purpose of the Test
No need to get fancy. This can just be a sentence or two about how you plan to use the test. "The test will be used as a preliminary screen when selecting candidates for Human Cannonball positions at any of our regional locations."
Content Areas and Test Length
Here's the meat of it. In this section you want to break out the different content areas of the test, which will probably map directly on to the KSAs you chose to use. You may, however, want to also break out sub-areas (e.g., breaking arithmetic down into addition, subtraction, multiplication, and division).
You also want to mention the total length of the test (probably by number of questions), then break out the number of questions per content area. Now, let's assume that your job analysis included some ratings of the criticality of each of these KSAs, so that you rank order them in terms of importance. This is important, since the amount of test content for each KSA should positively correlate to its relative importance. If Spatial Orientation is twice as important as Knowledge of the English Language for Human Cannonballs, then you probably want to have twice as many test items covering it. This is particularly true if you are pursuing a content validity strategy for your test and trying to directly map the content of the test onto the content of the job.
This all can be communicated pretty effectively by a simple table, like this:

Question Formats
Here you should list the question format(s) that you will use. I found that it's even helpful to identify what formats will be used to tap what content areas, since it gives you a plan to start developing item templates and the items themselves.
Possible question formats include:
And probably a bunch more I'm forgetting. But you get the idea.
Desired Psychometric Properties
Depending on the item formats, you might skip this, but generally it's wise to think about what kinds of psychometric properties you want for your test and place a stake in the ground so you'll know when you're on target. You probably want to mention some test-level statistics here (e.g., reliability), but you should also establish your thresholds for item level statistics like difficulty and item discrimination.
Overview of Test Administration and Scoring
Finally, you want to pound out a few words about the test administration and scoring process. The information in this section should answer these questions:
So, that's it. Even something as quick and dirty as what's outlined above should help get you thinking along the right lines and should make it easier to move on to other steps in the test development process, like item writing and pilot testing. Again, if you have suggestions or pointers, put 'em in the comments!
On April 30, 2009 by Jamie Madigan

In their book, Investing in People: Financial Impact of Human Resource Initiatives authors Wayne Cascio and John Boudreau hit on something I've written about elsewhere: making research understandable and meaningful to a wider audience, especially in the context of business. In other words, putting dollar signs in there.
After some introductions and defining of terms, the authors propose what the call a "LAMP" framework for approaching the measurement of Human Resources initiatives. LAMP is an acronym for a paradigm relating to planning and couching research projects in terms those folks in Operations or Accounting will actually care about. You must have a coherent Logic for the initiative and how it connects to the larger business, the right Analytics to make sense of the data, the right Measures to gather the data in the first place, and the right Processes to make use of what you discover.
This framework established, the next chunk of the book dealt with very specific questions that I-O psychologists working in the area of Human Resources are likely to be called upon to answer. How much does employee absenteeism really hurt the company? How worried should we be about our turnover? Is it going to benefit the company to put in a new fitness center for employee use or to pay for a smoking cessation program? Is it worth it to offer on-site day care for employees to use in emergencies? How concerned should I be about these employee satisfaction survey results? Just how bad ARE the hyena attacks on the third floor?
The authors provide logic, measures, analytics, and processes for each issue to educate the reader on how to approach each question as both a scientist and a business person. Good research methods, theory building, and scientific interpretation of results are stressed, but so is communicating the outcomes in terms of dollars (or whatever your local currency may be). If you need a formula for calculating the hourly cost of turnover or absenteeism, for example, you'll find it here.
The next major part of the book dives head first into the complicated (and often controversial) concept of staffing utility. The authors provide information on measuring and using staffing utility, then its use in decision making processes for things like enhanced selection systems and HR development programs. This section of the book is not for the faint of heart as it contains some pretty complicated algebra and calls to do some pretty challenging measurement. Here, let's look at Equation 11-10 here:
![]()
I mean, sweet jeebus, what is THAT? Well, they tell you, or at least try to. And utility (pun intended) of this kind of effort can't be understated when you are trying to sell a program to key decision makers or to communicate the impact of a new program.
So in general I liked Investing in People, even if it bogs down from time to time and once or twice the reader is presented with instructions that basically amount to "just make a best guess and plug the number in to your model." But the message of how to communicate and debate with stakeholders in their own language and on their home turf is an invaluable one if Human Resources in general and I-O psychology in specific is going to move forward and become a real driving force in business.
On April 24, 2009 by Jamie Madigan

If you work for a large company that uses a lot of pre-employment tests across a large number of jobs it can sometimes be hard to keep track of all the various information associated with a test. This is especially true when new people rotate in and out of roles connected with testing (e.g., recruiters).
There are plenty of collaboration tools out there, but one solution to this kind of issue that I tried experimenting with was to create a departmental wiki that I could keep updated and point people to in order to get basic information instead of hunting around for files scattered across drives or cabinets. For those of you who don't know, a wiki (derived from a Hawaiian word for "fast") is a collection of web pages designed so that anyone with a given set of permissions can edit them. These pages are organized and linked to each other so that you can search for information or follow links from one related topic to the next.
Wikepedia.org is probably the most well-known example of a wiki, but you can take that basic concept and create one of any size and purpose. Doing a Google search for "free wiki software" or something similar will net you a ton of links, but if you want to experiment with the idea and do a quick proof of concept like I did, I'd recommend TiddlyWiki to begin with. I liked it because it's entirely client side without anything to install on web servers and works in just about any web browser and OS. You can just plunk it down on your local hard drive to get it going, then move it to a shared network drive when you're ready to share it with your team.
Using a wiki to track test information need not be complicated. To start, you could create a wiki page for each test, each with the following sections:
REMEMBER, the concept is that anyone in your group would have access to this info, which is handy but it also means you should be careful about what you put on there. You wouldn't, for example, want to use the wiki to store test results or other sensitive information.
From there, I'd encourage you to build out the wiki to include other sets of pages related to testing. Wikis can also include links to files on your intranet or shared drive, which can be an extremely useful way to organize things. There are lots of possibilities, but here are a few:
And the great thing about the above is that in line with the collaborative spirit of the whole wiki concept, anyone with access to the wiki can edit an existing page or create a new one. So the people who schedule candidates for testing can be the ones to create and maintain the wiki pages dealing with those procedures. If you get everyone on board, pretty soon you'll have a thriving, perpetually updated tool that will make things easier for everyone.
Have you had any success with a departmental wiki for testing or other issues? If so, I'd love to hear about it. Leave a message by clicking on the "Comments" link below.
On April 17, 2009 by Jamie Madigan
Sometimes when I've implemented a pre-employment testing program I've just been there to insert Tab A into Slot B in a way that keeps Lawyer C from showing up sending us all to Cell Block D (and also, hire good people in a cost efficient way). Other times, though, I've had to build a business case for implementing testing, and part of that business case is estimating how much this whole thing is going to cost.
Of course, there are one-time startup fees that vary widely depending on what you're doing --you may do it in-house, hire an expensive consulting firm, or become a member of a consortium that offers testing services. Those are the costs that usually get the most attention, but it's also important to include in any business case the cost of keeping things going once you've released it into the wild. Somebody is going to pay for testing materials, test administration, scoring, scheduling of candidates, and all that.
Given all this, I thought it would be interesting to write a little about the process I recently went through to do just this as part of developing a larger business case for a new testing program.
But first, a disclaimer: All data presented here are fictional. None of the numbers here are indicative of anything about my employer or anyone else. What I'm presenting is just a basic method and formulas populated by some fake data so that you can plug in the real thing on your own.
To tackle this, it's best to build a model based on assumptions that build on each other to calculate bottom-line costs. The starting blanks that I try to fill in are these:
It may take some research to come up with educated guesses for these, but once you have them you label them "ASSUMPTIONS" and use them to calculate the following:
Say, you know what would be handy for all this? A spreadsheet! Download it, or here's a picture:

It might take some research to get good estimates, but the idea here is that you can modify the numbers in the "Assumptions" area and see what the resulting costs are in the "Calculated Costs" area. You might also build multiple models to compare various options, like one vendor versus another or outsourcing test administration versus keeping it in-house.
Or if you're REALLY on the ball (and you are, aren't you?), you can include these costs in a formal utility analysis or break-even analysis. But even if you don't go that far any of this info would be great to include in your business case.
If you look at the spreadsheet, you'll see that most of the formulas in the "Calculated Costs" area are self-explanatory. The only thing that may need explanation are the labor cost estimates (i.e., the dollar cost for test administration, scheduling, etc.). This is something that a lot of people might overlook, but which I think is critical to include so that you get the complete picture and don't get called out it by one of your stakeholders.
First, we make some assumptions about the test administrator/scheduler for the model:
Those assumptions made, we can make some quick calculations:
Annual salary / 2,080 = Hourly labor cost (e.g., $45,000 / 2,080 = $21.64)
Annual salary x .40 = estimated annual cost of benefits (e.g., $45,000 x .40 = $18,000)
Estimated annual cost of benefits / 2,080 = Estimated hourly cost of benefits (e.g., $18,000 / 2,080 = $8.65)
Hourly labor cost + hourly cost of benefits = Total hourly compensation (e.g., $21.64 + $8.65 = $30.29).
You can then use that to calculate labor costs for your administrator's time, plus I've created a formula in that spreadsheet to calculate it automatically.
So there you have it. This is only a starting point and you may need to tweak to make it fit yoru circumstances. For example, if you have two different people doing scheduling and administration, you might break those costs out (or plug an average salary into cell C13 if they're close). Or if you don't pay your test administrators for vacations and holidays you would want to subtract those numbers from the 2,080 hours per year that they work. Or maybe you're going to outsource test administration and you need to add a line in there for that.
But this template has been a good starting point for me. A lot of it is based on assumptions and educated guesses, but even that makes a big difference when you're building your business case. Hope you enjoy, and if you have any comments, tips, or suggestions make sure to click on the "Comments" link below and let me know.
On April 14, 2009 by Jamie Madigan
I may be looking for a new co-author for the column I write for SIOP's TIP magazine starting next year. The column aims to examine the bridge between research and practice, so we want one practitioner (that's me) and one academic. So if you're an academic type in the area of Industrial-Organizational Psychology and might be interested in helping, drop me a line. The magazine comes out quarterly, so that's only four columns a year.
On April 2, 2009 by Jamie Madigan
Occasionally I get some free time at work and I get to dip into my "wouldn't it be fun to..." file. Recently I did this and pulled out the idea of something I've not had a real chance to do since that Tests and Measurements class I took in grad school: Item Analysis.
Specifically, I had data from a few hundred test-takers on a written, multiple-choice job knowledge test. No follow-up research had been done since the initial test development, and I thought it would be fun (nerd!) to see how the individual items were performing according to good old classical test theory.
Unfortunately my current employer hasn't equipped me with any statistical analysis software like SAS or SPSS --boo!-- so I set to trying to figure out how to do this with plain old Microsoft Excel. Fortunately after some time with Google I found this page, which gives you a pretty good road map as to how to do it. I made a few tweaks to the process to meet my own needs, and thought I'd share with you all here.
Let me say something first in the interest of covering myself professionally and ethically: All data I present here are fictional; no actual test data from my employer or anywhere else are reproduced here. In fact, for the sake of illustration I'm simplifying the data set to 9 test takers and 10 test items. It's easy to extrapolate to whatever number of test-takers and items you need.
First, a Brief Review
To start, in addition to basic test-level statistics (mean, standard deviation, reliability), I wanted to know item difficulty and item discrimination. Item difficulty is often measured using the p statistic and basically describes the percentage of test-takers who get that item right. Here's the formula for p:
pi = (number of people getting an item correct) / (number of people taking the test)
Item discrimination, according to this old textbook I have here, is measured with the D statistic. It reveals how good an item is at discriminating between "good" candidates and "bad" candidates. Here, good/bad is conceptualized as whether the test-taker scored in the upper or lower third of the score distribution (some other sources I've seen use top/bottom quartile --you choose). The formula for D is:
D = (U - L) / n
Where U = number of people in the top third (according to overall test score) who got the item right, L = number of people in the lower group who passed the item, and n = number of people in each group (assuming they're equal).
There are other item-level stats I could look at, like item-total correlations (a.k.a., point-biserial correlations) or distractor analyses, but I thought difficulty and discrimination were a good start. That goal identified, the challenge was now to do this in Excel.
Setting up the Spreadsheet
Following the model here I created this spreadsheet:
(Click image above to embiggen or click here to download a copy of the spreadsheet.)
The first 11 rows of the spreadsheet contain the test data for our 9 hypothetical candidates. Row 13 shows the answer key.
Rows 14 through 18 give some information about what percentage of candidates chose each response option for each question. You can start to see trouble here, as with Item 1 --everybody answerd B, which is the correct answer. But more on that shortly. Row 18 shows the totals, and if it's below 100% (like with Item 5) that means one or more candidates left that one blank. If a LOT of people leave an item blank, that's generally a big red flag.
Calculating Scores and Results
The "Results" half of the spreadsheet starts to get into the meat of things. Rows 22 through 31 show each candidate's score on each item --1 point for getting a question right, and 0 points for getting it wrong. (I guess now is a good time to point out that this model doesn't work when you give partial credit.) It's worth taking a second to discuss the formulas in this section. The formula "=IF($E$5=$E$13,1,0)" in cell E23, for example, basically says that if that Manny Calavera's answer on Item 1 (cell E5) is equal to what's in the answer key for Item 1 (cell E13), then a "1" goes into that cell. If not, a "0."
It's also noteworthy that I used the $ symbol in those cell references (e.g., $E$5 instead of just E5), which locks those references in and prevents Excel from outsmarting itself if I do something like sort the data in the "Results" section. Which I did, for a reason I'll get to in a second.
Using the Spreadsheet to Calculate Item Difficulty
So, let's get to the punchlines: the item analyses. Row 33 contains the Item Difficulty (p) for each of the 10 test items. This is just the percentage of test-takers who got each item correct, and is calculated for each item by summing up the number of points across all test-takers and then dividing by the number of candidates (9 in this example). So for Item 1 the formula is "=SUM(E23:E31)/9"
Item difficulties that approach 1 are probably too easy; ones that get close to 0 are probably too hard. Either that or there's some kind of error in your scoring key so that you're incorrectly scoring the tests.
Using the Spreadsheet to Calculate Item Discrimination
Now for Item Discrimination, which is an index of how well an item differentiates between high and low performers. To calculate this, you just subtract the p value for the bottom third of the test takers from the p value for the top third. To get these two p values, I first sorted the test takers by Score. This is why using the $ symbol to make those cell references absolute earlier was important. Without that, Excel would diligently assume you wanted it to change the formulas that score each item (e.g., in cell E23) relative to the new orders. Then you'd get garbage.
The next step in calculating D values is shown in rows 34 and 35, which recalculates those p values, but only using the top and bottom thirds of the test-takers (the first 3 and the last 3 in this case, since we have 9 people in the mix).
Row 36 then calculates the D value for each item by subtracting the upper and lower p values.
Using the Spreadsheet to Test-Level Statistics
Finally, I tossed in a section that calculated those test-level statistics: mean socre, standard deviation (SD), reliability, and Standard Error of the Mean. To calculate reliability I used the Kuder-Richardson 21 formula, which again according to this old textbook I have is:
(k/(k-1))*(1-((X-X^2/k)/10*SD^2))
This is where k is the number of items, X is the mean test score, and SD is the test's standard deviation. And when you see something like X^2 means X to the second power, or X squared. The formula's in there in the Excel spreadsheet.
Standard Error of the Mean is easier to calculate. It's just the square root of (1-X)*SD. Again, the formula's in the Excel spreadsheet.
Making Use of the Results
So, putting this all together, what can we tell? There are a number of red flags you can hunt for in the item analyses. First is seeing if items are too easy. Right off the bat item 1 looks bad. It has a p value of 1.00, which means that every single person got it right. Mabye it's too easy or maybe there's something about the way it's written that's clueing everyone in to the correct answer. Either way, it's not contributing in any meaningful way and could probably be dropped, fixed, or replaced by a better item. In general, p values above .90 should be examined closely.
Conversely, you probably want to look for items that have very low p values. Look at Item 5. It has a p value of .11, which means that almost nobody got it right. Assuming you've got the right people taking the test, this probably means that the question is either too hard or you've got some error in the body of the question or the distractors that's throwing people off. Or it's even possible that your answer key is in error. It happens.
What about item Discrimination? Generally you want values in the middle of the road here, which means that your better test-takers are getting it right while your dunces and chowderheads are getting it wrong. Very low values --especially those approaching 0-- mean that the item just isn't telling the two groups apart. What's worse are items with negative Discrimination values. That happens when for some reason the item is missed by your top test takers and gotten right by the bottom ones. Take a close look at those and either fix them, drop them, or replace them.
So that's it. It's not everything you need to know about item analysis, but it was a fun little project for me and maybe you can benefit from using my spreadsheet as a starting point if you ever do anything similar. Again, thanks to Chris Elvin of this website, from whom I borrowed heavily for everything above.
On March 9, 2009 by Jamie Madigan

No, this isn't about the video game. The full title on this one is The Halo Effect ...and the Eight Other Business Delusions that Deceive Managers. In it, author Phil Rosenzweig sets out to take the business press and best sellers to task for a list of flaws in their thinking and chest thumping. Basically, it's a list of fallacies that you could compile from the chapter titles in most books on psychology, decision-making, and behavioral economics:
I liked this book quite a bit, in part because I just like exploring these little kinks in human nature, but also because Rosenzweig fully committed himself to a no bullshit, no pulled punches critique of the silliness you see in the business press and best-selling books like Built to Last or Good to Great (which I thought was transparently terrible, too). His diatribes are replete with real-world examples, quotes, and data compilations, but also always cogent and centered around one of the delusions above (though sometimes they bleed together, as you might expect). He spends a fair amount of time splendidly savaging people like Jim Collins (of Good to Great fame and fortune), calling him on the carpet for making sure that facts, science, and sound methodology don't get in the way of telling an uplifting story. It's great to see someone with both the moxie to say stuff like this and the scientific training to substantiate his critiques. If the Journal of Applied Psychology were more like this, I'd read it cover to cover every issue.
My only substantial complaint about the book is that it's almost all criticism and has very little in the way of solutions beyond "don't fall into this faulty mode of thinking." The subtext of the book is that business performance is gosh-darn hard to measure and even harder to predict or influence. So what do you do? How DO you identify the qualities that make businesses better? Clearly, some are better than others. What are the methodologies by which we can evaluate things in the absence of truely scientific experiments? The Halo Effect isn't much help there. But at least the author criticizes the ways NOT to do it in an entertaining and enlightening way.
On March 6, 2009 by Jamie Madigan
Ha! This was just too great not to share. From the frequently awesome webcomic xkcd:

I predict we will soon start seeing this strip pop up in Statistics 101 presentations all across the world.
On March 3, 2009 by Jamie Madigan

Oh, looks like my new column went up in TIP, in which my co-author and I talk about research that bridges the gap between scientists and practitioners. My part of this issue's column talked about some really interesting stuff going in the applicant reactions area, but coming at it from a slightly different perspective. I talk about how there's some opportunities for some really interesting and novel research by looking at how stakeholders other than applicants react to selection systems. It's an idea that refuses to stop rattling around in the back of my head, and I'd like to do some research in the near future if possible.
Most practitioners would have little difficulty imagining or even recalling from memory these kinds of beliefs in action. Who hasn’t had a hiring manager come in and insist that you explain why someone who passed your test is failing miserably on the job or demand that exceptions to the testing rules be made for a candidate who they have a really good gut feel for on account of some ineffable quality or some perplexing constellation of traits? For those of us who administer selection systems in organizations, these are the kinds of battles and challenges that we face daily, and many of us have come up with a list of well-rehearsed starting points for those discussions. Moreover, Highhouse, along with many of the people responding to his article in the same issue, provides some insight and suggestions not only for combating these attacks, but also for developing research programs to examine the issue scientifically.
On February 19, 2009 by Jamie Madigan
There was a nifty article in the latest issue of Journal of Applied Psychology in which the authors investigated what happens when you tailor a job posting to an applicant. Specifically, they looked at both telling how much an applicant's profile "fit" with the requirements of the job, as well as customizing the order in which certain information about the job is presented based on the applicants' claims of what's most important to them (e.g., the nature of the work versus benefits). This was all on a Web-based job posting system mocked up to resemble something like Monster.com.
The findings, in short, were that "better" applicants applied when they were provided this kind of customization. Also, interestingly, fewer applicants threw their hats in the ring for jobs when there were customizations to the posting, which if you think about it is a very good thing if it means that you're more qualified than unqualified ones.
Given the growing sophistication of web sites and the proliferation of social networking sites, this all has some pretty interesting implications. Look at how sites like Facebook or LinkedIn encourage you to provide information about yourself and engage in activities that bulid your profile to the point where it's practically a meta game with a "% complete" score. No need to go overboard, but a corporate recruiting portal could easily ask for not only information about the candidate's qualifications, but also information about what kind of aspects of the job are most important to them --location, values, career development, salary, benefits, nature of the job, whatever. Then you could programatically tailor your recruiting message by highlighting certain aspects of the job posting and/or presenting certain information first.
I even think an index of "There is a 70% match with your application profile" would be doable, and candidates would be both more likely to pay attention to job postings with higher ratings and less likely to spam you with applications for jobs with low ratings. In the end, they apply for jobs for which they're a better match, and your company's recruiters have a smaller set of more qualified resumes to dig through.
The article, by the way, is "Effects of Customization on Applicant Decisions and Applicant Pool Characteristics in a Web-Based Recruitment Context" by Brian Dineen and Raymond Noe. It's in the 2009, Vol. 94 No. 1 issue of Journal of Applied Psychology, pp 224-234.
On December 15, 2008 by Jamie Madigan
There's an old adage along the lines of "The more you know, the more you know what you don't know." I think it might have originally come from a Peanuts strip where Charlie Brown was going through his Nietzsche phase and staring into the abyss. Be that as it may, this concept came to mind recently when I was helping my employer go through a changeover to a new version of our recruiting website. I-O Psychologists and our professional neighbors know an awful lot about recruitment, but as I tried to help define how our jobs website would work, it occurred to me that we may not know the answer to one of the most important questions, like how the heck people use job boards.
I'm thinking about basic stuff here. How do people search for jobs? By keyword, by location, by job title, by salary range, by educational requirements? What kinds of factors make people more likely to come back to a website and look for newly posted jobs, or to sign up for e-mail or text alerts? Do they search or browse? How many clicks in the application process before someone decides it's not worth it? What are the effects of different kinds of information in job postings and the presentation thereof? What are the effects of noting (or not noting) selection systems like drug screens, physicals, pre-employment test, or background checks on applicant reactions or perceptions of the company?
Companies who roll out new internet job boards have a LOT of questions about these basics, and the answers dictate how things are going to be configured and the quality of experience job-seekers will have when they come looking for new opportunities. If you do it better than your competition, you're probably going to have a real advantage in the marketplace, just like any other superior recruiting activity. On the other hand, if you do things haphazardly, only the really determined (or desperate) will make it through the gauntlet.
As I said, some of these questions could probably be answered by synthesizing information from the recruiting literature and company culture literature. We know, for example, something about how people react to drug screens or to diversity statements in job postings. That's research that can be put to use. But I think there's big chunks of the solution missing in that what we don't know about are the nuts and bolts about how people use websites like these. What do people like, dislike, want, never use, et cetera? There are no shortage of experts on web design and web usability --indeed, it's grown into an entire industry. And while I'm sure someone could offer to tell you what color palate to use, what size to make your font, and where on the page to put your logo, I'm not sure anyone has sat down to tell you the best way to get people to search for jobs that match their qualifications or how long an online application can be before casual job seekers wander off.
Some colaboration is needed here. Of course, maybe despite the considerable time I spent with Google looking into this issue, there is a body of research out there and I just don't know about it. If you know that to be the case, please let me know!
On December 9, 2008 by Jamie Madigan

I've been thinking a bit about retesting policies lately. You know, if someone takes your employment test, do you let them take it again? When? How many times? Do you poke them with a stick first?
Based on what I've seen and heard from talking to other colleagues, the only thing that people seem to agree on is that they're needed. After that, recommendations get either vague or militantly specific. There do seem to be a few things that most of the experts agree you need to keep in mind.
First, how long do you make people wait before retesting? I call this "the cooldown timer" but that's just the World of Warcraft geek in me. The main concerns here are drains on company resources (in terms of how expensive it is to give a test) and a practice effect for test-takers. If your test is a hands-on work sample that takes 6 hours to complete and can only be administered one-on-one, then you may want to keep people from retesting as often as you might if you're talking about a 40-minute, paper-based test that can be given to dozens of people at a time.
The practice effect is a thornier problem. If a person is allowed to take the same test over and over again her score may have too much to do with practice and not enough to do with the validity of the test. Problem. This may be especially true of timed tests, or with things like tests of reading comprehension where the test taker would benefit from repeated exposure to the material. The nature of the test will have to inform your decision, but generally you can combat this by either having alternate forms of your test (expensive!) and/or having them wait a month or more between attempts so that they have a chance to forget (cheap!).
On the other hand, sometimes practice isn't a bad thing. Some skills (say, data entry or physical abilities) may be expected to change with practice, and if they improve that's a GOOD thing and you may not want to discourage people. Again, the nature of the test and what constructs it measures should inform your decision.
On the third hand, there are arguably some constructs that are highly unlikely to change over time. Personality is stable by definition. General mental ability doesn't change much in adults. In these cases allowing retesting may have more to do with controlling perceptions of fairness than with validity.
The second thing you want to address is which test results are a person's "official" ones. If you just work with a pass/fail result where a person has to pass a certain cut score in order to be put in an applicant pool, this is easy. Once a person passes the test, no retesting is needed or allowed. But if you use top-down selection or banding, things get trickier. Candidates may want to retest to move to the top of the pile and better their chances of getting the job. This is going to be particularly true if you always let their highest score be their official one.
My suggestion? Always make the most recent score the one that's used for any selection decision. People may try to improve their score through retesting, but if they backslide it's just part of the risk inherent to the process. Life is like a game of Chutes and Ladders that way.
Finally, you need to consider how long test results are good for. In other words, do they curdle like milk and expire if given long enough? Again, this is something where the nature of the test is going to have to be your guide. In general, aptitude, personality, and general mental ability tests aren't going to change, but tests of physical strength, skill, or even job knowledge are susceptible to the ravages of time and may call for quicker expiration dates.
On November 17, 2008 by Jamie Madigan

Thumbing through a recent edition of the Journal of Applied Psychology I came across an article dealing with a niche of the recruiting scene that a lot of us don't often think about, but which probably has its own set of rules: recruiting volunteers. It's no surprise that when you drop that whole paycheck thing from the equation that the rules change and other factors come into play when motivating people to just give you their time and effort.
Perhaps the most surprising and possibly counter-intuitive finding of this research was that telling potential volunteers how totally mind-blowingly awesome you and your charity are may work against you. Specifically, volunteers who were told that the organization was doing a super job at completing whatever goals it had were less likely to volunteer for them, possibly because they felt their services might be put to better use elsewhere. So, don't over sell yourself.
So what does make a big difference? For one, support. Potential volunteers were interested in signing up to the extent that they thought that the organization would provide them with the support they needed to do the job. I imagine this translates to an "Am I going to be wasting my time here?" sentiment? If you're giving up your weekends or evenings, you want to feel like someone is benefitting from it instead of just sitting and saying "Tisk, tisk. Someone should really DO something."
Interesting stuff. The full title is "Volunteer Recruitment: The Role of Organizational Support and Anticipated Resepect in Non-Voluneers' Attraction to Charitable Volunteer Organizations" by Edwin J. Boezeman and Naomi Ellemers. It's in volume 93 of Journal of Applied Psychology.
On November 13, 2008 by Jamie Madigan
Here's an interesting little news brief about how the "name letter effect" can supposedly influence our choice of employer. In short, we humans seem to give preference to things that begin with the letter of our first name. And now somebody has studied this in relation to choosing an employer.
In a new study published in Psychological Science, a journal of the Association for Psychological Science, the psychologists found that there is indeed a name-letter effect between employee names and the company they work for. There were 12% more matches than was expected based on the probability estimate. The researchers noted that "hence, for about one in nine people whose initials matched their company’s initial, choice of employer seems to have been influenced by the fact that the letters matched."
I KNEW there must have been a reason I compulsively engage in resume blasts to J.C. Penny's, J.P. Morgan, and Jethro's House of Chicken & Waffles.
On November 6, 2008 by Jamie Madigan

Here's another one to file in the "Why Aren't More People Doing This?" cabinet. One of the things that Internet capitalists have figured out is that people want to use the Web to meet people. You've got your fan sites and social networking sites like LinkedIn and Facebook, but I'm thinking here more along the lines of dating sites like match.com or eharmony.com, which facilitate your meeting potential partners for everything from a long-term romance to, well, you know... I've never had occasion to use these kinds of dating sites, but it's not hard to find stories --even ones told first-hand-- about people who have experienced great success with them. And I have used other websites of a slightly different bent, like meetup.com, to find groups of people interested in getting together to participate in our shared hobbies like photography and gaming.
Bells and whistles aside, at their core what these sites do is ask you about what you like and what you're interested in, then they show you people who have matching or complementary interests who you might like to meet, then they facilitate your getting together. This begs the question of why academic researchers and practitioners aren't doing this to seek out collaboration opportunities, especially those working in the area of Industrial-Organizational psychology.
This is actually an idea that a guy by the name of Alan Walker mentioned in an issue of Journal of Occupational and Organizational Psychology a while back, and it's been rattling around in the back of my head, given how frequently I'm called upon to think of the scientist/practitioner model. Researchers (including graduate students, especially graduate students) need real-world data to test their theories and get their publications. There's only so much you can accomplish by offering college sophomores extra credit to participate in yet another lab study. Practitioners have the data, or at least access to it. Practitioners also have problems that the researchers can help with. How do I reduce turnover in my call center? How do I best select people for my line crews? What sort of executive education curriculum would work best for my industry? Would adding biodata questions to my online applications help me select better candidates?
Yet researchers and practitioners are so often like ships passing in the night. They each WANT to hook up and play a few rounds of "show me your correlation coefficient" if you know what I mean, but it's a big world and unless you really know how to network you're just whistling in the dark. So wouldn't it be great if there were a website to play matchmaker? Say you were a researcher with a list of interests and you could go onto a site and see a list of decision-makers in organizations that have problems that line up with those interests? Or even just one that would be willing to let you include a few experimental items for that scale you're working on in exchange for measuring some other stuff while you're at it? Or what if you were a graduate student in need of data for your dissertation on a certain job taxonomy and you'd be willing to conduct job analysis as long as you could use the data for your own research?
Or heck, I'm a practitioner who would love to do more research but just doesn't have the time and hasn't kept on the bleeding edge of research like people whose job it is to do just that. But I've got access to some data, some applicants, some statistics and wouldn't mind working with someone to get a publication or presentation out of that if the circumstances were right. Or maybe two researchers working at different institutions want to get together to collaborate.
Granted, there are practical problems (reliability and timeliness and ownership of data come immediately to my mind), but there could be a lot of missed opportunities here as well. This is a niche that organizations like SIOP, SHRM, or the Academy of Management could really do us a service to fill.