Dear all,
I would appreciate having your advice/suggestions/comments on the following :
1 -- starting from a vector that contains the LENGTHS of DELETIONS (numerically, the values are from 1 to 10 000)
2 -- shall I display the ECDF by using the R code and some "limits" :
**BREAKS = c(0, 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 200, 300, 400, 500,
1000, 10000, 100000, 1000000, 10000000, 100000000, 1000000000)
ggplot(x, aes(LENGTH)) +
stat_ecdf(geom = "point") +
scale_x_continuous(name = "LENGTH of DEL",
breaks = BREAKS,
limits=c(0, 500))**
3 -- I am getting the following warning message : "Warning message: Removed 109 rows containing non-finite values (stat_ecdf)."
The question is : are these 109 values removed from VISUALIZATION as i set up the "limits", or are these 109 values removed from statistical CALCULATION?
4 -- in contrast, shall I use the standard R functions plot(ecdf), there is no "warning mesage"
**plot(ecdf(x$LENGTH), xlab="DEL LENGTH",
ylab="Fraction of DEL", main="DEL", xlim=c(0,500),
col = "dark red")**
Thanks a lot !
Try removing "limits" and then plot it. If it still throws the error, then your assumption is correct.