PLEASE CHECK THE FILE ATTACHED AND SEE IF YOU CAN DO IT ?
NOTE: TO BE DONE USING R STUDIO, PLEASE READ EVERYTHING BELOW CAREFULLY
Would you be able to do this project:
(((
Your final task is to prepare a short report documenting your analysis (no more than 4 pages, not including cover or appendices). Include your R code in the Appendix. Prepare a short Powerpoint show to present to your peers during final exam week. This presentation should not exceed 8 minutes.
The project:
Freeway Data and Weather
Research Question:
How does weather affect the capacity of a selected segment of OR-217?
Background
The schema loop in the class database contains data obtained from inductive loop sensors on OR-217 SB at 5-minute aggregations for calendar year 2009. The loop sensors collect volume, speed, occupancy and metrics on vehicle-miles-traveled, vehicle-hours-traveled, delay and information about bad detector readings in table loop.loopdata_5min_217sb_2009. The loop schema also contains tables highways, detectors, ramps, and stations. Detectors are related to ramps and stations. Begin by processing some simple data quality metrics to identify a segment that you wish to consider in more detail. Ideally, this segment would have high-quality data. Contour plots of data quality would be an excellent tool to explore this quickly. Next, the table weather.metar_2009 contains hourly precipitation data. You will need to define “capacity” in a repeatable manner for a selected time period, then process the data to generate these measures. R’s ability to process large amounts of data can be a limiting if you don’t plan properly. One suggestion would be to write SQL code that produces average hourly volumes and speeds that you could join to the weather data. Assign weather information to these observations, then graphically explore the differences in speed-flow or other curves for each weather category. Statistically quantify these results. Consider t-test of the means or ANOVA analysis
))
below are the csv tables for the data.
This is the data:
https://www.dropbox.com/sh/q4esue8k3enbv1b/InbRQU0zbu
please make sure there are 6 csv files. as you might know you can use the dropbox links in R studio as:
dataFrameName <- read.csv("DropBoxLinkHere.com/file.csv", header=TRUE, sep=",", na.strings="NA", dec=".", strip.white=TRUE)
to load the dataFrame you want into R studio.
please let me know if you can help doing this project which is the code + no more 4 pages writing paper.