Thanks to all for your comments, especialliy for pointing out that either evaluation approach is permissible. Just to close the loop, the decision was made (for me) to go with 2 sample task orders, the rationale being a more level playing field and a more streamlined evaluation process (2 sample vs. 10 real orders)