Foundations of data management class
Using the provided data set: coffee_shop_sales_2023.csv, write a Hadoop MapReduce
program to find answer to the following SQL query.
SELECT store_location, sum(transaction_qty * unit_price)
FROM coffee_shop_sales
where product_category = 'Coffee'
group by store_location
having count(*) >= 20000
Your program should use the provided template SQL2MR.java and fill in the code in the
marked areas.
Reminder on the step of compiling and running the program.
hadoop com.sun.tools.javac.Main SQL2MR.java
jar cf sql2mr.jar SQL2MR*.class
hadoop jar sql2mr.jar SQL2MR input output
Note that you need to remove the first row, the header row from the provided csv file,
and store it under the input directory.
Submission Instructions:
ā Submit three files named:
o SQL2MR.java
o sql2mr.jar
o part-r-00000
ā Do not zip the files