The study of variability in the various factors is very important in understanding the nature and trends involved in the variability. The variability pattern exists in the rainfall is responsible for the proportion of the rainfall in the different regions or areas. We know that the proportion of the rainfall is not same for different regions. There is low proportion of rainfall in some areas, high proportion of rainfall in some areas, however there is tremendous rainfall for particular areas. The amount of rainfall is measured in millimeter. There are so many factors responsible for the variability in the rainfall in different areas. Some possible factors may be climatic change, temperature, velocity of wind, etc. Actually, prediction of rainfall is very hard due to involvement of several factors or variables which are responsible for rainfall. Now a day, scientists make more advanced regression model for the purpose of rainfall prediction. Many factors are taken into account for prediction of rainfall. Prediction models are improved due to use of technology. Rainfall is very important for farming, drinking water, industries, etc. Water is called as life! So, importance of rainfall in human life and entire bio-diversity is very high. Life on the earth is not possible without water. Rainfall consists of all forms of the water particles and it may be in state of liquid or solid. For example, rainfall may be in state of rain, drizzle, hail or snow. The rainfall falls form the clouds in any form and it reaches to ground. Each country keeps the records of the rainfall. Most of the countries have their own independent departments for weather prediction and meteorological activities. In most of the countries in the world, the measurement of the rainfall is observed at the 9 am daily according to the local time. For the measurement of the rainfall, a standard instrument is used and this instrument is called as rain gauge. The precipitation is most commonly known as the rain and it is recorded with the amount of precipitation. The data for the rainfall is collected from different stations and data is collected at the Bureau for further analysis.
Data Collection
For checking different research questions or hypotheses established for the research study, the first step in any research study is to collect the data and check the hypotheses by using proper statistical methods. For this research study, the data is collected from the authorized website for the rainfall data records. The rainfall data is collected at 9 am of the local time at each day. The rainfall data is nothing but the record of previous 24 hours. For this research study, the data is collected from the sixth month of the year 2004 up to second month of the year 2017. Data is collected for the Ballarat Hopetown Rd station number 89111. Data is collected for the monthly precipitation total in millimetres. Also, the quality of the rain is recorded. The collected climate data pass through a number of stages in quality control over the specific period of time. Also, there would be problem of gaps and missing data as every station does not have a complete unbroken record of the rainfall and climate information. Sometimes the stations have automatic system for the data collection and there would be possibility of failing the automatic system which may produce the biased observations. These types of errors or mistakes during data collection are responsible for the wrong prediction of the rainfall. Also, this type of data is not useful for the further use or statistical analysis.
Research Methodology
In this topic, we have to see the methodology used for the given research study of rainfall data. It is important to use the proper research methodology for getting the unbiased results. For this research study we have to study the variability and trends in the rainfall data for the Australian station. The data collection method in detail is described in above topic. The observations of daily rainfall are normally made at the 9 am of the local time and it is record of the previous 24 hours. The rainfall includes all forms of the precipitation that reach the ground such as rain, drizzle, hail and snow. For this research study, the data is collected from the sixth month of the year 2004 up to second month of the year 2017. Data is collected for the Ballarat Hopetown Rd station number 89111. Data is collected for the monthly precipitation total in millimetres. Also, the quality of the rain is recorded. After collection of data, we have to use this data for statistical analysis. We have to perform the statistical analysis by using the time series tools and techniques. We have to study the trend analysis and time series analysis for the given data for rainfall for the particular station in the Australia.
Statistical Analysis
For the statistical analysis of the rainfall data, we have to use the different tools and techniques of statistical analysis. We have to use the descriptive statistics, graphical analysis, and inferential statistics for this research study. We have to use the results from the IBM Watson and other computational software’s. Let us see this statistical analysis in detail.
First we have to see the descriptive statistics for the monthly precipitation total in millimetres for the given data. The required descriptive statistics are summarised as below:
Descriptive Statistics |
|||||
N |
Minimum |
Maximum |
Mean |
Std. Deviation |
|
Monthly Precipitation Total (millimeters) |
152 |
1.00 |
209.60 |
54.40 |
31.10 |
Valid N (listwise) |
152 |
The average monthly precipitation total is given as 54.4 with the standard deviation of 31.1. The minimum monthly precipitation total is recorded as the 1 millimetre while the maximum monthly precipitation is recorded as the 209.60 millimetres. The data is collected for the 152 observations or months.
Now, we have to see the box plot for the data for monthly precipitation total. The required box plot is given as below:
The above box plot shows that there is existence of outliers and data is skewed at right.
The histogram for the rainfall data is given as below:
Above histogram shows the right skewed nature of the rainfall data.
Now, we have to see the sequence plot or time series analysis for the monthly precipitation total.
The sequence plot is given as below:
From the above diagram, it is observed that there is a presence of the trends. Also, continuously up and down movement of the rainfall based on the years is observed for the rainfall data.
Now, we have to see the sequence plot for the monthly precipitation total in millimetres. The required plot is given as below:
It is observed that the data for the monthly precipitation total in millimetres have the seasonal variations and trends as per different months.
The IBM Watson screenshot for the trend analysis of monthly precipitation total is given as below:
The above trend line shows the variation in the values of precipitation and there is no any specific pattern found with this trend line.
The area under curve for the trend line is given as below:
The comparison for the each year for the monthly precipitation total is summarised as below:
The above bar graph shows that there is no any specific distribution is followed by the precipitation values.
The study of autocorrelation is very useful in the prediction of rainfall. The autocorrelations between the time series data points with different lags are summarized in the following autocorrelation model.
Autocorrelations |
|||||
Series:Monthly Precipitation Total (millimetres) |
|||||
Lag |
Autocorrelation |
Std. Errora |
Box-Ljung Statistic |
||
Value |
df |
Sig.b |
|||
1 |
.222 |
.080 |
7.656 |
1 |
.006 |
2 |
.110 |
.080 |
9.531 |
2 |
.009 |
3 |
.099 |
.080 |
11.086 |
3 |
.011 |
4 |
.075 |
.080 |
11.980 |
4 |
.017 |
5 |
-.127 |
.079 |
14.558 |
5 |
.012 |
6 |
-.064 |
.079 |
15.222 |
6 |
.019 |
7 |
-.209 |
.079 |
22.253 |
7 |
.002 |
8 |
-.063 |
.078 |
22.893 |
8 |
.004 |
9 |
-.012 |
.078 |
22.916 |
9 |
.006 |
10 |
.082 |
.078 |
24.019 |
10 |
.008 |
11 |
.059 |
.078 |
24.599 |
11 |
.010 |
12 |
.153 |
.077 |
28.510 |
12 |
.005 |
13 |
.078 |
.077 |
29.528 |
13 |
.006 |
14 |
-.015 |
.077 |
29.569 |
14 |
.009 |
15 |
-.024 |
.077 |
29.669 |
15 |
.013 |
16 |
.042 |
.076 |
29.972 |
16 |
.018 |
a. The underlying process assumed is independence (white noise). |
|||||
b. Based on the asymptotic chi-square approximation. |
From the given autocorrelations with different lag points it is observed that about all autocorrelations are statistically significant at 5% level of significance as the corresponding p-values are less than alpha value 0.05.
Conclusions
References
Antony, J. (2003). Design of Experiments for Engineers and Scientists. Butterworth Limited.
Babbie, E. R. (2009). The Practice of Social Research. Wadsworth.
Beran, R. (2000). React scatterplot smoothers: Superefficiency through basis economy. Journal of the American Statistical Association.
Bickel, P. J. and Doksum, K. A. (2000). Mathematical Statistics: Basic Ideas and Selected Topics, Vol I. Prentice Hall.
Casella, G. and Berger, R. L. (2002). Statistical Inference. Duxbury Press.
Cox, D. R. and Hinkley, D. V. (2000). Theoretical Statistics. Chapman and Hall Ltd.
Degroot, M. and Schervish, M. (2002). Probability and Statistics. Addison – Wesley.
Dobson, A. J. (2001). An introduction to generalized linear models. Chapman and Hall Ltd.
Evans, M. (2004). Probability and Statistics: The Science of Uncertainty. Freeman and Company.
Hastle, T., Tibshirani, R. and Friedman, J. H. (2001). The elements of statistical learning: data mining, inference, and prediction: with 200 full-color illustrations. Springer – Verlag Inc.
Hogg, R., Craig, A., and McKean, J. (2004). An Introduction to Mathematical Statistics. Prentice Hall.
Liese, F. and Miescke, K. (2008). Statistical Decision Theory: Estimation, Testing, and Selection. Springer.
Pearl, J. (2000). Casuality: models, reasoning, and inference. Cambridge University Press.
Ross, S. (2014). Introduction to Probability and Statistics for Engineers and Scientists. London: Academic Press.
Todd, G. (2007). Descriptive Statistics. Topics in Biostatistics. New York: Springer.
Huck, S.W. (2007) Reading statistics and research. 5th ed., Allyn & Bacon: Boston, MA
Pallant, J. (2007) SPSS survival manual, 3rd ed., Allen & Unwin.
De Vaus, D. (2002) Analyzing social science data. SAGE Publications: London.
Wild, C.J., & Seber, G.A.F. (2000) Chance encounters: a first course in data analysis and inference. John Wiley & Sons: New York
Essay Writing Service Features
Our Experience
No matter how complex your assignment is, we can find the right professional for your specific task. Contact Essay is an essay writing company that hires only the smartest minds to help you with your projects. Our expertise allows us to provide students with high-quality academic writing, editing & proofreading services.Free Features
Free revision policy
$10Free bibliography & reference
$8Free title page
$8Free formatting
$8How Our Essay Writing Service Works
First, you will need to complete an order form. It's not difficult but, in case there is anything you find not to be clear, you may always call us so that we can guide you through it. On the order form, you will need to include some basic information concerning your order: subject, topic, number of pages, etc. We also encourage our clients to upload any relevant information or sources that will help.
Complete the order formOnce we have all the information and instructions that we need, we select the most suitable writer for your assignment. While everything seems to be clear, the writer, who has complete knowledge of the subject, may need clarification from you. It is at that point that you would receive a call or email from us.
Writer’s assignmentAs soon as the writer has finished, it will be delivered both to the website and to your email address so that you will not miss it. If your deadline is close at hand, we will place a call to you to make sure that you receive the paper on time.
Completing the order and download