Statistics

Statistics
Project description
Section A – Land Registry Website task:
www.gov.uk/government/organisations/land-registry
This website gives access to large downloadable datasets about the most recent house purchases. There are always concerns about issues including data protection, privacy, and the costs of providing data.
A1 The UK Government, through its Open Data Initiative, freely releases large amounts of data, including the information on the Land Registry website. In up to 100 words, consider the possible costs and benefits of the free availablility of data such as that contained on the Land Registry site.
Hint: you may find the following link useful when preparing your answer: http://data.gov.uk/blog/what-did-open-data-ever-do-us
NOTE : DO NOT COPY AND PASTE DIRECTLY FROM THE WEBSITE FOR YOUR ANSWER. YOU WILL GET NO MARKS.
A2 Price Paid Data is residential property price data for all the residential property sales in England and Wales that are lodged for registration in that month. The following information is available for each property :
??the full address, the price paid, the date of transfer, the property type etc.
From the Land Registry page, go to the Price Paid Data website:
https://www.gov.uk/government/statistical-data-sets/price-paid-data-downloads
and locate the Price Paid Data for the Current month.
This can be downloaded in csv format and in a machine readable format as linked
data.
The files below include the transactions received at Land Registry in the period from the first to the last day of the previous month.
Download the current months records in .csv format.
Open the .csv file in Excel, and type in titles for the columns A to O inclusive, as follows, and then delete the blank columns to make the data set ready for analysis:
B
Price ()
D
Postcode
H
Number
J
Street
K
District
L
Town
N
County/City
Highlight the column (County/City) and then press the Filter button. Untick the Select All box. Then tick to see only the Lincolnshire records.
2
It will be best if you then copy the Lincolnshire records to a new worksheet in Excel and then use Data/Data Analysis to analyse the prices in the current month by producing outputs from Excel expressing the following:
a histogram of the price paid data for Lincolnshire
a statement of the averages for the data used
a statement of the measures of spread for the data used.
Finally, report on all your findings and their significance in about 100-125 words.
YOU MUST INDICATE THE SPECIFIC MONTH USED FOR YOUR ANALYSIS IN ORDER TO FULLY COMPLETE THIS SECTION OF THE ASSIGNMENT.
A3 Generate tables of the five lowest and five highest prices in Lincolnshire. Ensure you also identify their postcodes and towns.
We publish the following attribution statement as we use or publish the price paid data:
Data produced by Land Registry Crown copyright 2015.
If you publish the data you should include information about the nature of the data and any relevant dates for the period of time covered. For example:
This data covers the transactions received at Land Registry in the period [first working day of the month] to [last working day of the month]. Crown copyright 2015.
If you have found an error with the data please contact Her Majesty’s Land Registry (HMLR).
3
SECTION B (Film Distributors Association website www.launchingfilms.com/)
CONTEXT
You are to analyse some features of the current and historical UK cinema industry based on data from the website, and compile some regression analyses based on time- series data.
B1 Access the Research & Databank area and then Facts and Figures. Access the UK monthly film admissions data.
Create one timeseries chart of the monthly UK cinema admissions for the 3-year period 2011-2014 inclusive.
Comment on the findings from the graph, making comparisons between the years and identify any seasonality in the Admission figures. (100-150 words).
B2 Access the current top 15 films in UK cinemas shown on the webpage (at the point in time where you access the webpages data). This can be directly copied into Excel.
Form a table of results in your report.
Produce a scatter graph to illustrate the Weekend total () plotted against the
Weeks on release.
Fit a regression model of your choice to the data.
Carefully explain what all the regression outputs signify for this dataset.
YOU MUST INDICATE THE SPECIFIC WEEKEND USED FOR YOUR ANALYSIS IN ORDER TO FULLY COMPLETE THIS SECTION OF THE ASSIGNMENT.
Comment on your findings from both graphs (100-150 Words).
Hint: for examples of how to create the charts and perform a regression analysis please refer to the examples and recipe cards in the course textbook (in particular Chapter 4) that we have considered in detail during the Module Seminars.
Indicative breakdown of marks
Section A
A1 10% A2 30%
A3
10% 50% B1 15%
Section B
B2
Presentation and effective communication 15%
Total
20% 35%
4
100%
Report style/general presentation and quality of communication 15%
Pay attention to page numbers, diagrams, histograms, tables etc. which must be correctly coded in academic format e.g. “Table 3”, Figure 5. They must be cross-referenced in the main part of the report.
The use of an Introduction to the work is encouraged.
Sub-headings within each section are essential. Para-phrase sub-headings from
the questions, do NOT merely repeat the text of the question.
You will not gain marks if you made little attempt to check spelling, punctuation and grammar. There is no excuse for incorrect spelling when you have a word-processor.
Total 100%
HINTS
Always show any formulae used and any methodology (working out).
You may use any calculator or computer. If you supply an answer, which is
incorrect, and do not show your method then you will be penalised.
A report style is preferred to a narrative approach. The length of report should be about 8-10 sides of A4. Do not use appendices.
It is essential to word-process this assignment.
Spreadsheet printouts should be incorporated using Cut and Paste.
If you have problems getting access to Excel, inform your tutor.
DO NOT LEAVE THIS ASSIGNMENT UNTIL THE LAST MINUTE. YOU MAY FIND THAT IT TAKES LONGER THAN YOU THINK! START NOW!!!
Refer to the Scheme of Work (on Blackboard) for details of how the Learning Outcomes for the module are assessed in this assignment.
Excel 2003 (on some University Computers) and 2010 (the Librarys Cloud)
When using Excel you may need to check that it has got Data Analysis. If not, click on Tools/Add-Ins and then select Analysis ToolPak which will bring in the statistical analysis functions. Use the Classwork & Homework file on Blackboard t

find the cost of your paper