Business Case
A bike-sharing system is given, and every second of a state of all bike stations is represented in a NoSQL document-please see the attachment.
The dataset of all rides (time is given in seconds) is provided as a SQL table, with the following fields: pick_up_time, drop_off_time, user_id, start_station_id, end_station_id.
Directions
Write pseudocode to describe an efficient MapReduce algorithm (mapper and reducer) that allow merging those datasets to provide a relational dataset with all initial data from a rides dataset along with the number of bikes available at pick up the station when the bike was taken, and the number of parking spots available at the drop off location when the bike was returned.
Hint: Identify a type of data manipulation technique addressed in the textbook, and customize their description to fit this case.