This repo contains the scripts to do stat over clueweb dataset (for subfeature project), and preprocess the data, and train the dataset.
prepare_ranklib_input.sh
: preprocess the raw data given by Michaeltrain.sh
: ranklib train (e.g. LambdaMART)show_map.sh
andshow_ndcg.sh
: show training and testing results