RE: You will have to provide some more details of what you are looking for. Such as why is the number of libraries mentioned? Are your "sequences" actually data sets? And what areyou comparing to get the r-square? I must apologize for the lack of information as I was unsure as to the extent I should detail it. My data consists of two parts. The first is bacterial in nature (i.e. the sequence information) and the second is physiological (e.g. mortality and developmental status). I am looking to draw correlations between the presence/absence of specific bacteria and the rate of mortality and development. With honey bees there are only a hand full (five) of core bacteria of which several have been shown to form epithelial biofilms and other have been shown to cause a melanization of said cells. Although the data cannot be normalized for pearson's correlations, I would like to see how the abundance of a given bacteria correlates (relationship) to the physiological parameters I have data for. Thus I turned to the spearman's rho. This statistic however does not allow me to guage how much of the variation in Y (the physiological parameter) is explained by X (the bacterial parameter). I would like to be able to talk about the extent (r^2 or an alternative) that the bacterial abundance explains the physiological parameter it is being compared to. I must say here that most of the spearman correlations are monotonic in nature, however the relationships of interest to me are more linear in nature (goodness of fit tests before and after transformations < 0.03). That is they are not able to be normalized to meet the assumptions of normality but they have, graphically speaking, a linear relationship. It would be of benefit to me to be able to place a value on the amount of variation explained. For example, I have data about bacterium, found in my study in low and high abundance between different treatments, which has been shown to form a close relationship with the host epitheial cells (biofilm), and would like to show its correlation to mortality and deveopment at the different abundance levels. I hope this helps a bit. If not please let me know. Again, I am looking for a single value such as the r^2 to show how much of the variation in Y can be explained by X in a nearly linear relationship which is not able to be normalized. From what I have read (limited in most respects) that if the relationship in nearly linear one could use the spearman's rho for the p-value and the pearson's r^2 for the explaination of shared variance. Again, any help would be greatly appreacitaed. Best, pmaes
... View more