I am trying to understand a density plot. It has narrow peaks and means are far away from the peaks. What would be the reason for such behavior? Should I be doing a different plot to understand this data other than a density plot?
Description of data: It's user performance and different colors represent different groups. You can think performance as step count and I have logged the performance.
ggplot(dt, aes(x=log(step_count), fill=groups)) + geom_density(alpha=.3) + geom_vline(data=mean.steps, aes(xintercept=mean.val, color=groups), linetype="dashed", size=.5)
Best Answer
One thing to note is that the means are far from the peaks for each of the groups except the blue one. All of your non-blue groups are skewed right (and thus not normally distributed) meaning the median performance of each group is less than the mean performance. What this means in the context of your data is that most users' performances are on the lower end while a few users have higher performances.
I'd say you have the right choice of visualization.
Similar Posts:
- Solved – Why does this distribution look completely different after splitting into groups
- Solved – Identifying clusters with OPTICS in R
- Solved – Plotting a frequency distribution with count and score (precision) for the frequenct
- Solved – Smooth a circular/periodic time series
- Solved – How to combine histograms with a density plot in R