From 6bf41a3ca125c0cf9d7a1a5002968621176a7cd8 Mon Sep 17 00:00:00 2001 From: prathamkshah13 Date: Fri, 14 Apr 2023 05:23:54 +0530 Subject: [PATCH] updated with milestone6 task2 --- analysis/analysis2.ipynb | 41 ++++++++++++++++++++++++++++++++++++++++ project_vision.md | 2 +- 2 files changed, 42 insertions(+), 1 deletion(-) diff --git a/analysis/analysis2.ipynb b/analysis/analysis2.ipynb index 7b65073..6cfa722 100644 --- a/analysis/analysis2.ipynb +++ b/analysis/analysis2.ipynb @@ -2181,6 +2181,47 @@ "#### Most of the products with high ratings (above 3.5) are sold at a discount of around 10^1 to 10^2 percent.The reduced_category variable shows that the majority of the high-rated products sold at a discount belong to the categories 'Electronics','Computer and Accessories' and 'Home & Kitchen'. The categories 'Electronics' and 'Mobiles & Accessories' have a wider range of discount percentages for products with high ratings. The purple and orange dots represent the categories'Home & Kitchen' and 'Electronics', respectively, which have the most high-rated products sold at a discount." ] }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "### Conclusion" + ] + }, + { + "cell_type": "markdown", + "metadata": {}, + "source": [ + "#### Research question 1\n", + "##### Discount percentage vs rating of the products \n", + "###### Based on the analysis conducted(Hexbin plot for Discount price and Rating), we can conclude that the majority of the products in the dataset have ratings between 4 and 5, with discounted prices ranging between 100 to 140 for products with a rating of 4.2 and higher. The distribution of discounted percentages across all products indicates that most products have a discounted percentage of around 0.4-0.6% (We got this from the bar plot for Distributionof Frequency). Furthermore, the scatter plot for Discount Percentage vs Rating by Category reveals that high-rated products (above 3.5) are typically sold at discounts ranging from log (10^1 to 10^2) percent. \n", + "\n", + "#### Research question 2\n", + "##### Distribution of ratings across different product categories. For this I have created a scatter plot of Discount Percentage vs Rating by Category( This graph was also used for answering Discount percentage vs rating of the products). we know that Most of the products have high ratings (above 3.5). \n", + "\n", + "#### Research question 3\n", + "##### Number of ratingcount vary with the actual price of the products:\n", + "###### For this I created a scatter plot showing the relationship between the actual price of the products and the number of reviews (rating count). The plot shows that for lower actual prices, there are very few reviews (sparse points). As the actual price increases, the number of reviews increases, and the points become more closely packed. Between actual prices of 10^1 to 10^2(Logarithmic scale) , the points are densely packed indicating a high number of reviews for products with these prices. The y-axis shows the logarithmic scale of the number of reviews, and it indicates that most products have fewer than 10^3(Log scale) reviews, with points being more closely packed between 10^2 to 10^3(Log scale).\n", + "\n", + "#### Research question 4\n", + "##### Distinct Categories with Discount percentage: \n", + "###### The bar plot shows us the average discount percentage for each product category, while the violin plot shows the distribution of discount percentage within each category. From the bar plot, we can see that Office Products and Toys and games have the highest average discount percentage, while Home improvement has the lowest. This information can help us understand which categories tend to have more discounts and which ones tend to have less.The violin plot provides us with more information about the distribution of discount percentage within each category. For example, we can see that the distribution of discounts for Electronics is quite wide, with some products having very high discounts and others having very low discounts. On the other hand, the distribution for Toys and Games is relatively narrow, showing heavier discounts.Overall, these plots provide us with valuable insights into the discount patterns within each product category, which can be usefull for business optimization for their pricing strategies\n", + "\n", + "#### Research question 5\n", + "##### Rating vs Actual price:\n", + "###### I have created a Scatter plot with Regression line for depicting the trend between Rating and Actual price. From the plot we know that most products have rating between 3.5 and 4.5 and the price for moost products with ratingog 4.0 is ₹5000.I have also created two more plots for the frequency of actual price (Histogram) and count of Ratings (Box plot).Confirming the result we got from the scatter plot. \n", + "\n", + "\n", + "\n", + "\n", + "¶\n" + ] + }, { "cell_type": "code", "execution_count": null, diff --git a/project_vision.md b/project_vision.md index 6a04162..87a5aac 100644 --- a/project_vision.md +++ b/project_vision.md @@ -4,4 +4,4 @@ The aim of our project is to empower significant insights into Amazon sales thro ## Contracted Grade -For this project, we agree to contract for an A. +For this project, we agree to contract for a B.