MovieAnalysis

The following links are helpful for the project,

The dataset01.csv and dataset02.csv consists of 27000 entries.

For project, we have filtered the dataset for year 1990-2014, country as USA, language as English for which we get 10060 entries.

Project Implementation Steps:

Run the filteringDataset.ipynb to filter the dataset and remove duplicate ID’s. After executing we get datasetWithoutBoxOffice.csv.
Run extractBoxOffice.ipynb to extract box office using WebCrawl class present in webcrawl.py. After executing we get datasetWithBoxOffice.csv.

Optional(but suggested): We have made 10 copies of extractBoxOffice.ipynb with 1000 entries each, and then using mergeCSV.ipynb we have merged all the csv's to get datasetWithBoxOffice.csv.

Alternatively, you can run extractBoxOfficeAllEntries.ipynb to extract box office for all entries, but consumes lot of time (in hrs).

Run extractTicketInflationPrice.ipynb to extract table of ticket inflation price by year. After executing we get ticketPriceInflation.csv.
Run adjustTicketPriceInflation.ipynb. After executing we get finalDataset.csv.
Run plotDataset1.ipynb, plotDataset2.ipynb to visualise the dataset.

For Windows when converting to csv use encoding as UTF-8.

Images

Snapshot of Final dataset

One of the plot of dataset

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

MovieAnalysis

Project Implementation Steps:

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 127 Commits
LICENSE		LICENSE
MovieAnalysisPlot.gif		MovieAnalysisPlot.gif
README.md		README.md
SnapshotFinalDataset.jpg		SnapshotFinalDataset.jpg
_config.yml		_config.yml
adjustTicketPriceInflation.ipynb		adjustTicketPriceInflation.ipynb
boxoffice.csv		boxoffice.csv
dataset01.csv		dataset01.csv
dataset02.csv		dataset02.csv
datasetWithBoxoffice.csv		datasetWithBoxoffice.csv
datasetWithoutBoxOffice.csv		datasetWithoutBoxOffice.csv
extractBoxOffice.ipynb		extractBoxOffice.ipynb
extractBoxOfficeAllEntries.ipynb		extractBoxOfficeAllEntries.ipynb
extractTicketInflationPrice.ipynb		extractTicketInflationPrice.ipynb
filteringDataset.ipynb		filteringDataset.ipynb
finalDataset.csv		finalDataset.csv
mergeCSV.ipynb		mergeCSV.ipynb
plotDataset1.ipynb		plotDataset1.ipynb
plotDataset2.ipynb		plotDataset2.ipynb
ticketPriceInflation.csv		ticketPriceInflation.csv
webcrawl.py		webcrawl.py

License

ShivakumarSwamy/MovieAnalysis

Folders and files

Latest commit

History

Repository files navigation

MovieAnalysis

Project Implementation Steps:

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages