|
Estimates of genetic parameters for Holstein cows for test-day yield traits with a random regression cubic spline model
B.J. DeGroot1, J.F. Keown1, L.D. Van Vleck2 and S.D. Kachman1
1Department of Animal Science, University of Nebraska, Lincoln, NE, USA
2Roman L. Hruska U.S. Meat Animal Research Center, ARS-USDA, Lincoln, NE, USA
Corresponding author: J.F. Keown
E-mail: jkeown1@unl.edu
Genet. Mol. Res. 6 (2): 434-444 (2007)
Received March 14, 2007
Accepted May 25, 2007
Published June 30, 2007
ABSTRACT. Genetic parameters were estimated with restricted maximum likelihood for individual test-day milk, fat, and protein yields and somatic cell scores with a random regression cubic spline model. Test-day records of Holstein cows that calved from 1994 through early 1999 were obtained from Dairy Records Management Systems in Raleigh, North Carolina, for the analysis. Estimates of heritability for individual test-days and estimates of genetic and phenotypic correlations between test-days were obtained from estimates of variances and covariances from the cubic spline analysis. Estimates were calculated of genetic parameters for the averages of the test days within each of the ten 30-day test intervals. The model included herd test-day, age at first calving, and bovine somatropin treatment as fixed factors. Cubic splines were fitted for the overall lactation curve and for random additive genetic and permanent environmental effects, with five predetermined knots or four intervals between days 0, 50, 135, 220, and 305. Estimates of heritability for lactation one ranged from 0.10 to 0.15, 0.06 to 0.10, 0.09 to 0.15, and 0.02 to 0.06 for test-day one to test-day 10 for milk, fat, and protein yields and somatic cell scores, respectively. Estimates of heritability were greater in lactations two and three. Estimates of heritability increased over the course of the lactation. Estimates of genetic and phenotypic correlations were smaller for test-days further apart.
Key words: Random regression, Genetic parameters, Cubic spline
INTRODUCTION
There has been an increased interest in changing the type of data used for genetic evaluation of dairy cattle. Traditional models use data from test-day records combined into 305-day mature equivalent lactation records. The test-day model would use test-day records collected at various times during the lactation. The test-day model could provide some advantages compared to traditional models. These advantages would include: 1) an increased accuracy of genetic evaluations for yields, 2) direct and more precise adjustments for temporary environmental effects on test-days, 3) end-of lactation yields would not need to be extended for culled cows or for cows with records in-progress (Jensen, 2001), and 4) models could include the shape of the lactation curve for individual cows (Schaeffer and Dekkers, 1994). Test-day models tend to be more complex with more equations and parameters to be estimated, which is the main disadvantage compared to more traditional models (Jensen, 2001).
Various test-day models have been described in reviews by Swalve (2000), Misztal et al. (2000), Schaeffer et al. (2000), and Jensen (2001). These models have included a multiple-trait model with reduced rank, a repeatability model, a random regression model, and a covariance function model (Ptak and Schaeffer, 1993; Schaeffer and Dekkers, 1994; Wiggans and Goddard, 1997; Meyer and Hill, 1997). With the multiple-trait model, each test-day is modeled as a separate trait. Wiggans and Goddard (1997) suggested that the many test-day traits could be reduced to a few traits with a canonical transformation. With the repeatability model, test-day records within a lactation are considered to be repeated measures with fixed regression on days in milk, as defined by Ali and Schaeffer (1987). This model was later modified by Ptak and Schaeffer (1993) to adjust for test-day means at different stages of lactation.
Schaeffer and Dekkers (1994) and Jamrozik and Schaeffer (1997) extended the fixed regression model to a random regression model that was proposed by Henderson Jr. (1982). With such models, the shape of the lactation curve is modeled as a function of fixed effects. The random genetic and permanent environmental effects associated with an individual cow are modeled as deviations from the fixed lactation curve. Other authors developed functions that model lactation curves based on the natural shape of the lactation (e.g., Wilmink, 1987). Kirkpatrick et al. (1990, 1994) illustrated a method of estimating a matrix of coefficients for covariance functions with Legendre polynomials. Meyer and Hill (1997) demonstrated that models with a covariance function are equivalent to models with covariances among traits defined as a function of time or age.
White et al. (1999) described the use of smoothing cubic splines to model the lactation curve using test-day records. This random regression model consisted of fitting a series of cubic polynomials that are continuous and centered through knots or intervals along the lactation curve. White et al. (1999) explained that the spline function provided more flexibility to produce a "good" fit compared with polynomial functions. Another advantage of the spline models is the number of parameters that need to be estimated. The spline models need only four (co)variance parameters to be estimated (White et al., 1990), whereas the polynomial models require 0.5q(q+1) (co)variance parameters to be estimated, where q is the order of the polynomial (White et al., 1990).
We estimated genetic parameters for test-day milk, fat, and protein yields and somatic cell scores (SCS) for lactations one, two and three of Holstein cows, with a random regression, cubic spline model.
MATERIAL AND METHODS
Data
Test-day yields of Holstein cows that calved from 1994 through early 1999 were obtained from Dairy Records Management Systems of Raleigh, North Carolina. Each cow was required to have 2X per day milking, 305-day mature equivalent lactation yields, with at least eight test-day records. Lactation records were eliminated if days in milk was less than 200 days or greater than 350 days, if sire or dam identification was missing, lactation was initiated by abortion, or calving data were missing. Each test-day record was coded whether the cow was or was not treated with bovine somatotropin (bST). Only herds in which at least half of the cows received bST treatment were included in the analysis. Cows were considered bST-treated if the bST treatment started no later than test-day three and if bST treatment was coded for at least five consecutive test-days (coded 1 in analyses). Untreated cows were required not to have any bST treatment codes during the lactation (coded 0 in the analysis). Table 1 contains the number of test-day observations after edits that were used in the analysis for each trait and lactation combination. Fewer test-day records for fat and protein yields and SCS were available because some herds recorded only milk yield.

MODEL AND METHODOLOGY
A single-trait, random regression, cubic spline model was used to fit fixed lactation curves and deviations for each animal for both random genetic and permanent environmental components. The cubic spline model consists of a series of piecewise cubic polynomials that are defined for a set of pre-assigned interval along the lactation curve. The model is constrained so that the cubic spline function and its first two derivatives are continuous at the knots (breakpoints along the lactation curve), which determine the intervals (White et al., 1999).
Verbyla et al. (1999) and White et al. (1999) demonstrated how to incorporate the cubic spline function into the standard mixed model when the knots are assigned before the analysis. The model can be written as a random regression animal model:

where y is a vector of test-day yields or test-day SCS. The vector Β contains fixed effects, including fixed regression coefficients, and X is the incidence matrix for the fixed effects, which includes the bST code (0,1), herd test-day, covariate for age at the beginning of lactation, and a covariate for days in milk for each test-day record. The random effects are: s, a vector of overall spline parameters with length q-2; a1, a vector of genetic intercept (ai) and slope (asl) breeding value parameters for each animal of length 2m with m equal to the number of animals; as, a vector of spline breeding value parameters for the cubic spline function for each animal with length (q-2) · m; pe1, a vector of permanent environmental intercept (pei) and slope (pesl) parameters with length 2p with p equal to the number of levels of factors; pes, a vector of permanent environmental parameters for the spline function with length (q-2)p, and e, a vector of residual effects. The matrices Wa and Wpe are the incidence matrices of the linear coefficients for animal genetic and permanent environmental effects, and Zs, Za, and Zpe are the incidence matrices of the spline coefficients for overall spline, animal genetic, and permanent environmental parameters of the spline function based on the number of predetermined knots. The distributions of the random effects are defined as:

where D is an identity matrix of dimensions (q-2) x (q-2), I are the identity matrices of appropriate order, and A is the animal numerator relationship matrix.
The analysis was done using ASREML to estimate (co)variance components (Gilmour et al., 1997). The predetermined knots were at days 0, 50, 135, 220, and 305. Convergence was presumed when the REML log-likelihood changed less than 0.002 from the previous iteration and the individual variance parameter estimates changed less than 1%. The analysis was restarted after the first convergence until the log-likelihood value was considered converged.
RESULTS AND DISCUSSION
Test-day milk yield
Estimates of genetic, permanent environmental, and phenotypic variances for test-day milk yields for the first three lactations are in Table 2, as calculated for the midpoints of the test-day intervals. Table 3 contains the estimates of heritabilities for the first three lactations. Test-day milk yields had estimates of heritability that ranged from 0.10 to 0.15, 0.10 to 0.18, and 0.09 to 0.17 for lactations one, two, and three, respectively. Estimates of heritability increased steadily from test one to test 10. Estimates in later lactations were greater than for lactation one. The estimates of heritability were less than estimates reported by White et al. (1999), who used a random regression cubic spline model, Tijani et al. (1999), who used a random regression model using Legendre polynomials covariance functions, and were much smaller than estimates by Jamrozik and Schaeffer (1997), who used a random regression model with functions of ratios of days and the natural logarithm of days in milk. The estimates were similar to estimates reported by Gengler et al. (2001), using a random regression model with Legendre polynomials as the covariance function for lactation one. Estimates were less compared to later lactations.


For the three lactations, the overall genetic variance decreased from test one to test three and then gradually increased over the course of the lactation. The permanent environmental variances were variable during the early stages of lactations and were relatively constant during the mid and later stages of lactations. The estimates of genetic correlations ranged from 0.34 to 0.98 for lactation one and were similar for later lactations. The estimates of genetic and phenotypic correlations were high between test-day milk yields on test-days close together compared with yields for test-days that were more days apart. Estimates are similar to those reported in previous studies (Tijani et al., 1999; White et al., 1999; Gengler et al., 2001).
Test-day fat and protein yields
Estimates of genetic, permanent environmental, and phenotypic variances for the first three lactations are in Tables 4 and 5 for test-day fat and protein yields, respectively. Tables 6 and 7 contain estimates of heritabilities for test-day fat and protein yields, respectively. Test-day fat yields had estimates of heritability that ranged from 0.06 to 0.10, 0.05 to 0.16, and 0.06 to 0.15 for lactations one, two, and three, respectively. Test-day protein yields had estimates of heritability that ranged from 0.09 to 0.15, 0.08 to 0.16, and 0.07 to 0.15 for lactations one, two, and three, respectively. Estimates of heritability for test-day fat and protein yields increased steadily over the course of the lactations. The estimates of heritability for test-day fat and protein yields were less than estimates reported by Tijani et al. (1999) and Gengler et al. (2001), who used a random regression model with Legendre polynomials as covariance functions and were much less than estimates by Jamrozik and Schaeffer (1997), using a random regression model with functions of ratios of days and natural logarithm of days in milk.




Estimates of overall genetic variances for test-day fat and protein yields increased from the early to mid stages of lactation and remained constant from mid to later stages of lactation. Estimates of overall permanent environmental variances decreased slightly over the lactation. The estimates of overall genetic variance were nearly constant during the early part of the lactation and increased during the mid and later stages of lactation. Estimates of permanent environmental variances were variable during the early stages of lactation and remained constant during the mid and later stages of lactation. The estimates of genetic correlations ranged from 0.49 to 0.97 and 0.36 to 0.99 for lactation one test-day fat and protein yields, respectively. The estimates of genetic correlations were similar for the later lactations. Estimates of correlations were high between test-day fat and protein yields on test-days close together compared with yields on test-days that were more days apart. These estimates are similar to those reported in previous studies (Tijani et al., 1999; Gengler et al., 2001).
Somatic cell scores
Estimates of genetic, permanent environmental, and phenotypic variances for test-day SCS for the first three lactations are in Table 8. Table 9 contains the estimates of heritabilities for the first three lactations. Test-day SCS had estimates of heritability that ranged from 0.02 to 0.06, 0.04 to 0.04, and 0.03 to 0.06 for lactations one, two, and three, respectively. The estimates of heritability were less than estimates reported by Haile-Mariam et al. (2001) for a random regression model with a second-order polynomial.


The estimates of overall permanent environmental variance decreased and then increased during the early stage of lactation and were relatively constant during the mid and later stages of lactation. The estimates of genetic correlations ranged from 0.83 to 0.99 for lactation one and were similar for the later lactations. Estimates of genetic correlations among test-day SCS were high between test-day SCS on test-days close together compared with scores on test-days that were farther apart. These estimates are similar to those reported by Haile-Mariam et al. (2001).
CONCLUSIONS
The cubic spline model provided flexibility for estimating genetic parameters from test-day yields and SCS. The flexibility of the model extends to estimating genetic and permanent environmental (co)variances (White et al., 1999). Estimates of heritability increased as days in milk increased for all lactations for test-day yields and SCS. Estimates of heritability were less than previous estimates reported with other types of random regression models. The smaller estimates could be due to the type of data set used in the analysis. This data set contained more grade cows than registered cows. Estimates of genetic parameters are usually lower for grade cows compared to registered cows, which may be caused by a greater chance of misidentification of sires and dams for grade cows.
Estimates of genetic and permanent environmental variances for test-day yields and SCS were higher for lactations two and three than for lactation one. Lactation two had estimates of variances due to genetic and permanent environmental effects in the spline function that were more variable than estimates for lactations one and three. Estimates of genetic and phenotypic correlations decreased with an increase in days between when the yields were measured.
The cubic spline model may be a suitable method of estimating genetic parameters over the course of the lactation. In our study, the estimates of genetic parameters with the cubic spline model were comparable to estimates found with other methods. The major advantage of this method is the smaller number of variance components that need to be estimated, when compared with polynomial and multiple trait methods. Further research would need to be done to determine the proper number and placement of the knots for days in milk and comparison of computational time needed to set up and solve equations for other methods used to estimate genetic parameters.
REFERENCES
Ali TE and Schaeffer LR (1987). Accounting for covariances among test day milk yields in dairy cows. Can. J. Anim. Sci. 67: 1233-1240.
Gengler N, Tijani A, Wiggans GR and Philpot JC (2001). Indirect estimation of (co)variances functions for test-day yields during first and second lactations in the United States. J. Dairy Sci. 84 (Online): 542-571.
Gilmour AR, Thompson R, Cullis BR and Welham SJ (1997). ASREML Manual. New South Wales Department of Agriculture, Orange.
Haile-Mariam M, Goddard ME and Bowman PJ (2001). Estimates of genetic parameters for daily somatic cell count of Australian dairy cattle. J. Dairy Sci. 84: 1255-1264.
Henderson CR Jr (1982). Analysis of covariance in the mixed model: higher-level, nonhomogeneous, and random regressions. Biometrics 38: 623-640.
Jamrozik J and Schaeffer LR (1997). Estimates of genetic parameters for a test day model with random regressions for yield traits of first lactation Holsteins. J. Dairy Sci. 80: 762-770.
Jensen J (2001). Genetic evaluation of dairy cattle using test-day models. J. Dairy Sci. 84: 2803-2812.
Kirkpatrick M, Lofsvold D and Bulmer M (1990). Analysis of the inheritance, selection and evolution of growth trajectories. Genetics 124: 979-993.
Kirkpatrick M, Hill WG and Thompson R (1994). Estimating the covariance structure of traits during growth and ageing, illustrated with lactation in dairy cattle. Genet. Res. 64: 57-69.
Meyer K and Hill WG (1997). Estimation of genetic and phenotypic covariance functions for longitudinal or ‘repeated’ records by restricted maximum likelihood. Livest. Prod. Sci. 47: 185-200.
Misztal I, Strabel T, Jamrozik J, Mantysaari EA, et al. (2000). Strategies for estimating the parameters needed for different test-day models. J. Dairy Sci. 83: 1125-1134.
Ptak E and Schaeffer LR (1993). Use of test day yields for genetic evaluation of dairy sires and cows. Livest. Prod. Sci. 34: 23-34.
Schaeffer LR and Dekkers JCM (1994). Random regressions in animal models for test-day production in dairy cattle. Proceedings of the 5th World Congr. Genet. Appl. Livest Prod., Guelph, 433-446.
Schaeffer LR, Jamrozik J, Kistemaker GJ and Van Doormaal BJ (2000). Experience with a test-day model. J. Dairy Sci. 83: 1135-1144.
Swalve HH (2000). Theoretical basis and computational methods for different test-day genetic evaluation methods. J. Dairy Sci. 83: 1115-1124.
Tijani A, Wiggans GR, Van Tassell CP and Philpot JC (1999). Use of (co)variances functions to describe (co)variances for test day yield. J. Dairy Sci. 82 (Online): 226-240.
Verbyla AP, Cullis BR, Kenward MG and Welham SJ (1999). The analysis of designed experiments and longitudinal data by using smoothing splines. Appl. Statist. 48: 269-311.
White IM, Thompson R and Brotherstone S (1999). Genetic and environmental smoothing of lactation curves with cubic splines. J. Dairy Sci. 82: 632-638.
Wiggans GR and Goddard ME (1997). A computationally feasible test day model for genetic evaluation of yield traits in the United States. J. Dairy Sci. 80: 1795-1800.
Wilmink JBM (1987). Adjustment of test-day milk, fat, and protein yields for age season and stage of lactation. Livest. Prod. Sci. 16: 335-348.
|