In this project we experimented with apache spark queries on big data datasets like the movielens dataset ("https://grouplens.org/datasets/movielens/") and tried to optimise their perfomance both on local cluster scenarios and at cloud/server scenarios like the livy server("https://livy.apache.org/").
-
Notifications
You must be signed in to change notification settings - Fork 0
In this project we make queries that query big data like the movielens dataset that is used here,then we run these experiments to compare perfomance at local machine clusters and at livy server clusters.
License
OperaDevelop07/Big-Data-Querying-with-Apache-Spark
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
In this project we make queries that query big data like the movielens dataset that is used here,then we run these experiments to compare perfomance at local machine clusters and at livy server clusters.
Topics
Resources
License
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published