“Yelp” is an online business that accepts and publishes reviews in the fields of local businesses and daily life incidents. Yelpers have written 71 million reviews to date. “Yelp” has become a very crucial site specially for small businesses that can achieve success or close down business as per online reviews.
The data analysis undertakes yelp reviews for restaurants and cafes; summarises the “Online Reviews” vis sentiment analysis. The data analysis is executed on the basis of Yelp_reviews.csv file that contains 1569264 rows and 12 columns. The file size including yelp_reviews.csv is 122.5 MB in size. The big data is analyses with the help of “R” software. The data analysis is both qualitative and quantitative simultaneously.
The average rating is satisfactory. (Mean = 3.743 and SD = 1.311468). Hence, overall the reviews are indicating satisfaction.
The number characters of reviews have range from 0 to 1047. The average number of characters of reviews is 126 with SD = 115.498.
The total number of positive words used in a review is 7.07 on an average with standard deviation 5.927. The total number of positive words used in a review has range 0 to 94.
The total number of negative words used in a review is 2.55 on an average with standard deviation 3.25. The total number of positive words used in a review has range 0 to 65.
Hence, positive words in the yelp review is far greater than negative words in the review.
The average net sentiment score is very low with mean = 4.522 and standard deviation = 4.522. Its range is (-59) to 80. The low net sentiment average is obviously a reason of concern.
Among the first twenty selected samples, the modal discrete value of positive words is 3 with the 4-time occurrences.
The distribution of first twenty cases is positively and right skewed.
Among the first twenty selected samples, the modal discrete value of negative words is 0 with the 8-time occurrences followed by 2 and 3 negative words with 4-time occurrences.
The distribution of first twenty cases is positively and highly right skewed.
Out of first 20 cases of net sentiment, only 3 values are positive and rest of the 17 values are negative. The values of the first 20 cases lies in the interval of (-4) to 12.
Out of first 20 cases, the value of net sentiment “3” occurred mostly for the 5 times followed by the value of net sentiment “5” occurred for the 3 times. Overall, the distribution is slightly negatively skewed.
The samples that have referred that-
The bar plot shows that average review length is highest for low ratings (stars 1 and 2). The average review length is decreasing as the rating improves afterwards 2.
Yes, on an average, positive reviews are lengthier than negative reviews (127>118).
Businesses with highest and lowest rating:
The lowest rating (stars 1) is maximum for the business “6LM_Klmp3hOP0JmsMCKRqQ” and minimum for the business “- -D12rW_xO8GuYBomlg9zw”.
Businesses with highest and lowest rating:
The rating (stars 2) is maximum for the business “Xhg93cMdemu5pAMkDoEdtQ” and minimum for the business “- -lemggGHgoG6ipd_RMb-g”.
Businesses with highest and lowest rating:
The moderate rating (stars 3) is maximum for the business “Xhg93cMdemu5pAMkDoEdtQ” and minimum for the business “- -4Pe8BZ6gj57VFL5mUE8g”.
Businesses with highest and lowest rating:
The high rating (stars 4) is maximum for the business “4bEjOyTaDG24SY5TxsaUNQ” and minimum for the business “- -4Pe8BZ6gj57VFL5mUE8g”.
Businesses with highest and lowest rating:
The highest rating (stars 5) is maximum for the business “2e2e7WgqU1BnpxmQL5jbfw” and minimum for the business “- -qeSYxyn62mMjWvznNTdg”.
The two variables ratings (stars) and number of useful voters (votes_useful) are uncorrelated (correlation coefficient = -0.04897). “votes_useful” is moderately and positively correlated with length of the reviews (correlation coefficient = 0.3258).
The linear regression model is executed assuming “votes_useful” as dependent variable and “stars” as well as “review_length” as independent variables. The p-values shows that review length has significant association with votes_useful but “stars” does not have linear significant association with votes_useful.
Conclusion:
The overall analysis depicts that positive reviews are more than negative reviews, that shows an optimistic attitude of the people towards “lodging and food” than pessimistic approach. The low ratings are found to be more reviewed than high ratings. However, average sentiment of the people towards the restaurants and cafes is not satisfactory. Usefulness of votes of consumers also has statistical significant association with review length. The negative ratings are more discussed than positive ratings in terms of review length. Although, positive feedbacks are slightly greater than negative feedbacks in almost all the cases discussed.
Essay Writing Service Features
Our Experience
No matter how complex your assignment is, we can find the right professional for your specific task. Contact Essay is an essay writing company that hires only the smartest minds to help you with your projects. Our expertise allows us to provide students with high-quality academic writing, editing & proofreading services.Free Features
Free revision policy
$10Free bibliography & reference
$8Free title page
$8Free formatting
$8How Our Essay Writing Service Works
First, you will need to complete an order form. It's not difficult but, in case there is anything you find not to be clear, you may always call us so that we can guide you through it. On the order form, you will need to include some basic information concerning your order: subject, topic, number of pages, etc. We also encourage our clients to upload any relevant information or sources that will help.
Complete the order formOnce we have all the information and instructions that we need, we select the most suitable writer for your assignment. While everything seems to be clear, the writer, who has complete knowledge of the subject, may need clarification from you. It is at that point that you would receive a call or email from us.
Writer’s assignmentAs soon as the writer has finished, it will be delivered both to the website and to your email address so that you will not miss it. If your deadline is close at hand, we will place a call to you to make sure that you receive the paper on time.
Completing the order and download