about ECDF in ggplot2
0
0
Entering edit mode
5.8 years ago
Bogdan ★ 1.4k

Dear all,

I would appreciate having your advice/suggestions/comments on the following :

1 -- starting from a vector that contains the LENGTHS of DELETIONS (numerically, the values are from 1 to 10 000)

2 -- shall I display the ECDF by using the R code and some "limits" :

**BREAKS = c(0, 10, 20, 30, 40, 50, 60, 70, 80, 90, 100, 200, 300, 400, 500,
           1000, 10000, 100000, 1000000, 10000000, 100000000, 1000000000)


ggplot(x, aes(LENGTH)) +
          stat_ecdf(geom = "point") +
          scale_x_continuous(name = "LENGTH of DEL",
                             breaks = BREAKS,
                             limits=c(0, 500))**

3 -- I am getting the following warning message : "Warning message: Removed 109 rows containing non-finite values (stat_ecdf)."

The question is : are these 109 values removed from VISUALIZATION as i set up the "limits", or are these 109 values removed from statistical CALCULATION?

4 -- in contrast, shall I use the standard R functions plot(ecdf), there is no "warning mesage"

**plot(ecdf(x$LENGTH), xlab="DEL LENGTH", 
                     ylab="Fraction of DEL", main="DEL", xlim=c(0,500),
                     col = "dark red")**

Thanks a lot !

ggplot2 • 2.0k views
ADD COMMENT
0
Entering edit mode
The question is : are these 109 values removed from VISUALIZATION as i set up the "limits", or are these 109 values removed from statistical CALCULATION?

Try removing "limits" and then plot it. If it still throws the error, then your assumption is correct.

ADD REPLY

Login before adding your answer.

Traffic: 1879 users visited in the last hour
Help About
FAQ
Access RSS
API
Stats

Use of this site constitutes acceptance of our User Agreement and Privacy Policy.

Powered by the version 2.3.6