Check out the section on "character classes" in perlre (their marked with braces, [ ]), as these will allow you to match hyphens and numbers as well. Also you could do that push(...) while ... as @found_items = $data =~ m#\s+((\w+\s*)+)</td>#g;.