Kudos for a well-crafted question. I wish I had the answers to all of your questions.
1) Is the baseline covariate parameterized correctly in the model statement? Do I need both the NO3 cov and NO3cov*Sample_Date to account for the fact that the relationship of the response variable to the covariate changes over time?
I can see the value in using (the mean of the first four dates of) NO3cov as a covariate. I think it would make sense to use log(NO3cov) if you are using a lognormal distribution for Concentration_NO3. The model assumes a linear relationship between the link-scale of the response and the covariate.
NO3cov*Sample_Date allows the slope of the linear regression of Concentration_NO3 (on the link scale) to vary by Sample_Date. You should retain NO3cov if the interaction is in the model statement.
2) Do I need to change or include an additional random statement when I add the covariate? will the df be correct for the covariate effect with the specification that I have?
Adding a covariate into a mixed model is an appreciable complication, depending on the hierarchical level at which the covariate is measures (here, Rep*Irr_Trt*N_Trt, aka subplot); check out random coefficient models in the Littell et al. text (SAS for Mixed Models, 2nd ed), Stroup (Generalized Linear Mixed Models), and Milliken and Johnson (Analysis of Messy Data, Vol III Analysis of Covariance).
I always go into a model with some idea of what I think are appropriate denom df, just to check.
3) How does the inculsion of the covariate affect the existing repeated measures covariance structure?
Not sure about that.
4) Is this even an appropriate use of a covariate?
Google "change in baseline ancova".
5) Is it a "problem" that the main effect of NO3cov is not significant while the interaction effect of NO3cov*Sample_Date is? What about the increase in AICC value when the covariate was inculded?
Nope. Google "ancova centering". This text
https://books.google.com/books/about/Multiple_Regression.html?id=LcWLUyXcmnkC
has a nice introductory level discussion of centering.
Regarding the AICC increasing with the inclusion of the covariate: To compare models that differ in fixed effects factor, you need to use a true maximum likelihood method (LAPLACE or QUAD), rather than REML.
There are many possible different bells and whistles to be considered. Is the relationship linear? Does the slope vary with Irr_Trt and/or N_Trt? Is the slope constant for all whole plots, or does it vary?
Stepping back further, I would ask: What is your research question about sampling dates? 21 or 25 levels is a lot to sort out in an ANOVA context. Why did you make all these observations? Are you actually interested in comparing means among all 21 or 25? Would it make sense to extract some agronomically meaningful statistic from all these measurements to use as a response, or to regress on sampling date? Is there a statistician at your institution that you could work with, or a statistics prof, or a statistical consulting center? (I know, there may not be anyone, or at least anyone that knows more than you.) Lots of things to ponder, and a good stat colleague would be a more useful thing than this forum.
... View more