High-throughput subset matching on commodity GPU-based systems