Programming Challenge Description: Crawl an HTML page to extract file names containing certain patterns from hyperlinks Given a one line HTML "page", find all hyperlinks (e.g., URLs specified within and not as plain text) and return all the names of zip files that contain word "data" and come from www.example.com web site. The output should be a comma-separated list without spaces. If no such file was detected, print empty line (e.g., “”). Example: Input: some text Some other text some more text Output: global_data.zip,local_data.zip Input: Your program should read lines of text from standard input. Each line may or may not contain one or more target files. Output: Print to standard output a single line containing a comma-separated list of such file names without spaces. If no such file was detected, print empty line (e.g., “”). Test 1 Test Input World Data Part 1
World Data Part 2 Expected Output world_data1.zip,world_data2.zip Test 2 Test Input

Question

IBM