Issue 
A&A
Volume 636, April 2020



Article Number  A70  
Number of page(s)  18  
Section  Planets and planetary systems  
DOI  https://doi.org/10.1051/00046361/201937412  
Published online  20 April 2020 
Mitigating flicker noise in highprecision photometry
I. Characterization of the noise structure, impact on the inferred transit parameters, and predictions for CHEOPS observations
^{1}
Space Research Institute, Austrian Academy of Sciences,
Schmiedlstr. 6,
8042
Graz, Austria
^{2}
Aix Marseille Univ., CNRS, CNES, LAM,
Marseille,
France
email: sophia.sulis@lam.fr
^{3}
Observatoire de l’Université de Genève,
51 chemin des Maillettes,
1290
Sauverny,
Switzerland
^{4}
Institute of Physics, University of Graz,
Universitätsplatz 5,
8010
Graz, Austria
^{5}
Kanzelhöhe Observatory for Solar and Environmental Research, University of Graz,
Kanzelhöhe 19,
9521
Treffen, Austria
^{6}
Space sciences, Technologies and Astrophysics Research (STAR) Institute, Université de Liège,
19C Allée du sixaoût,
4000
Liège,
Belgium
Received:
24
December
2019
Accepted:
9
March
2020
Context. In photometry, the shorttimescale stellar variability (“flicker”), such as that caused by granulation and solarlike oscillations, can reach amplitudes comparable to the transit depth of Earthsized planets and is correlated over the typical transit timescales. It can introduce systematic errors on the inferred planetary parameters when a small number of transits are observed.
Aims. The objective of this paper is to characterize the statistical properties of the flicker noise and quantify its impact on the inferred transit parameters.
Methods. We used the extensive solar observations obtained with SoHO/VIRGO to characterize flicker noise. We simulated realistic transits across the solar disk using SDO/HMI data and used these to obtain transit light curves, which we used to estimate the errors made on the transit parameters due to the presence of real solar noise. We make these light curves publicly available. To extend the study to a wider parameter range, we derived the properties of flicker noise using Kepler observations and studied their dependence on stellar parameters. Finally, we predicted the limiting stellar apparent magnitude for which the properties of the flicker noise can be extracted using highprecision CHEOPS and PLATO observations.
Results. Stellar granulation is a stochastic colored noise, and is stationary with respect to the stellar magnetic cycle. Both the flicker correlation timescales and amplitudes increase with the stellar mass and radius. If these correlations are not taken into account when fitting for the parameters of transiting exoplanets, this can bias the inferred parameters. In particular, we find errors of up to 10% on the ratio between the planetary and stellar radius (R_{p}∕R_{s}) for an Earthsized planet orbiting a Sunlike star.
Conclusions. Flicker will significantly affect the inferred parameters of transits observed at high precision with CHEOPS and PLATO for F and G stars. Dedicated modeling strategies need to be developed to accurately characterize both the star and the transiting exoplanets.
Key words: techniques: photometric / planetary systems / stars: activity / Sun: granulation / methods: statistical
© ESO 2020
1 Introduction
The following decade will see the outcome of several missions in the field of extrasolar planets. With the new space missions like the Transiting Exoplanet Survey Satellite (TESS; Ricker et al. 2015), the Characterizing Exoplanet Satellite (CHEOPS; Fortier et al. 2014), and the Planetary Transits and Oscillations of stars mission (PLATO; Rauer et al. 2014), we expect to be able to detect and precisely characterize several thousands of new transiting Neptune to Earthlike planets. However, as it was already the case with bright stars observed with Kepler (Gilliland et al. 2011, 2015), the highprecision photometry of these instruments will not be limited by photon noise but by stellar variability. Indeed, at the stellar surfaces, several phenomena (e.g., spots, plages, flares, convection, oscillations) evolve on different timescales and generate variability that degrades the detection and the shape determination of planetary transits (see e.g., Oshagh 2018 for a recent review).
In this paper, we focus on the stellar variability taking place on timescales similar to the duration of a single planetary transit (i.e., <1 day). On these timescales, the dominant stellar activity contribution of quiet stars of around solar mass comes from the surface convective motions and the pressuremode oscillations. In the case of exoplanet transits, star spot crossings can also punctually contribute to the shorttimescale noise. In this study however, we disregard these punctual noise sources as they are not regularly present in the observations, and their contribution can be averaged out by analyzing hundreds of short individual time series.
Oscillations are studied extensively through asteroseismology. Pressuremode oscillations are generated in the stellar convective envelope of stars of solar mass and allow us to probe the stellar interiors. Their characteristics (amplitudes and periods) directly inform on the evolutionary stage of the star. Solartype stars oscillate with a period of several minutes and generate a photometric background signal with an amplitude of several tens of parts per million (ppm; Harvey 1988). The amplitudes and frequencies of these pmodes are known to change with the stellar magnetic cycle (Chaplin et al. 2011; Salabert et al. 2011; García et al. 2013).
Convection results from turbulent plasma motion at the stellar surface. When resolved, ascending hot plasma surfacing at the granules appears bright, and cool plasma descending in the intergranular lanes appears darker, leading to variable contrasts in brightness over time. The individual granulation cells can only be resolved for the solar surface, where the individual cells have average sizes of 1000 km (Title et al. 1989) and a median “turnover” timescale between 7 and 10 min (Nesis et al. 2002). However, granulation is an evolving process: the size of the granules grows and shrinks with time, the cells merge and split with the surrounding granules, and the resulting photometric variability appears correlated over timescales larger than the turnover period (Seleznyov et al. 2011). At large scales (~ 3 × 10^{7} m), we can also observe a conglomerate of convection cells when mapping the magnetic flow on the solar surface.
Thanks to the Solar and Heliospheric Observatory (SoHO) measurements, the photometric signature of solar granulation is known to be approximately 100 ppm (Dravins 1988; Fröhlich et al. 1997; Aigrain et al. 2004). The typical amplitudes and turnover timescales of surface granulation depend on the stellar parameters. Amplitudes increase with effective temperature (and possibly thestellar metallicity; see Corsaro et al. 2017) and decrease with decreasing stellar mass and/or increasing surface gravity (Svensson & Ludwig 2005). Both decrease with increasing mean oscillation frequency ν_{max} (Dravins 1988; Kallinger et al. 2014). Recently, highprecision Kepler measurements have made it possible to measure the photometricamplitude of the granulation variability in increasing detail, which has given rise to a new technique for deriving the stellar surface gravity (Mathur et al. 2011; Bastien et al. 2016; Cranmer et al. 2014; Pande et al. 2018) and density (Kipping et al. 2014).
While granulation contains extensive information for stellar physics, because of its stochastic nature it is typically considered as noise from the point of view of detecting and/or characterizing planets and their atmospheres. In an exoplanetary context, granulation is often referred to flicker noise as its power spectrum follows the form of a powerlaw function in a specific frequency range related to its turnover timescales. In practice, because the granulation turnover timescale is small (≪ 1 day) for solartype stars, this stellar variability can be considered uncorrelated from one planetary transit to another. Therefore, the influence of this noise can be reduced when several transit events are observed and the overall signaltonoise ratio (S/N) can be improved byphasefolding the planetary transits with the planet orbital period. However, this technique is not efficient when only a small number of transits are observed (e.g., for long period planets) or when the transits show transit timing variations (multiplanetary systems). Hence, the statistical properties of the flicker noise need to be known in order to design dedicated modeling strategies correctly accounting for (and possibly reducing) its influence when estimating the parameters of transiting exoplanets.
Granulation noise is often modeled using Harvey law profiles (Harvey 1985) with parameters (amplitude and timescales) estimated based on the observed stellar power spectral density (PSD; Pallé et al. 1999; Kallinger et al. 2014; Cranmer et al. 2014). One success of this technique is the relation between the parameters of the Harvey law profiles and the stellar fundamental parameters, which appear to be correlated (Kallinger et al. 2014). Recently, Pereira et al. (2019) proposed to couple Gaussian Processes (GP) noise modeling (Rasmussen & Williams 2005) with PSDs based on Harveylike profiles and showed that taking into account the flicker correlations within the transit analysis tends to improve the accuracy of the inferred transit parameters. Barclay et al. (2015) came to a similar conclusion using other GP noise models to fit for granulation and derive the parameters of the transiting hot Jupiter Kepler91b. A more realistic way to model stellar granulation is to turn to modern threedimensional radiative hydrodynamical simulations of stellar convection (Nordlund et al. 2009). While computationally demanding, these codes allow the generation of realistic photometric time series of flicker noise and could provide valuable diagnostics to extract the flicker properties affecting transits (Chiavassa et al. 2017). This has been shown by Chiavassa et al. (2015) with the transit of Venus in 2004.
In this work, we aim to understand the effect of stellar flicker on determining accurate and precise properties of exoplanets through transit photometry and describe how this noise behaves as a function of the stellar parameters. This paper is organized as follows. In Sect. 2, we summarize the statistical properties of solar flicker using 21 yr of continuous solar observations provided by SoHO. In Sect. 3, we analyze the impact of solar flicker on the determination of the transit parameters. In Sect. 4, we use shortcadence Kepler observations to extract the flicker properties and show their dependence with the stellar parameters. In Sect. 5, we discuss the potential for the photometric characterization of this noise source with the future highprecision observations of CHEOPS and PLATO. We conclude in Sect. 6.
2 Statistical characterization of shorttimescale solar variability
This section summarizes the statistical properties of solar granulation in photometric observations. We aim to list the properties that are needed by signal processing routines to analyze the influence of this noise source and derive appropriate noise models.
2.1 VIRGO observations
The Sun has been monitored since 1996 by the ESA/NASA SoHO spacecraft. Onboard, the Variability of solar IRradiance and Gravity Oscillations (VIRGO) experiment measures the spectral irradiance with a threechannel sun photometer (SPM) of 5 nm bandwidth at wavelengths of 402 (blue), 500 (green), and862 (red) nm. Observations at these wavelengths span different heights in the solar photosphere (from − 20 km compared to the base of the solar photosphere for the green and blue channels to + 10 km for the red, Jiménez et al. 2005). The data are integrated over 60 s and centered around the full minute. The duty cycle of the 21 yr of VIRGO time series (from April 11, 1996, to March 30, 2017) is around 96%, making it the best data set available today to derive the statistical properties of the solar variability. The photon noise in each of the three channels is below 10 ppm (Salabert et al. 2017). A full description of the instrument’s characteristics and technical calibration procedures can be found in Fröhlich et al. (1995, 1997), and Jiménez et al. (2002).
The available data (level 1) have been converted to physical units (W m^{−2} nm^{−1}), corrected bytemperature variations, and calibrated to a constant distance between the spacecraft and the Sun. We carefully corrected for the instrumental degradation^{1} evolving over time following the procedure below:
 1.
For computational reasons, we split the whole data set into subseries, each spanning 365 days.
 2.
We smoothed each subseries using a running average of 3 days in length and localized the 3σ outliers. These outliers were disregarded in step 3.
 3.
We fitted a highdegree polynomial function to the smoothed time series and used it to normalize the initial time series (containing the outliers).
 4.
We removed the 5σ outliers from each corrected time series,
 5.
We finally compared the resulting three SPM datasets and kept the common data points between these three series.
As the emergent flux decreases from optical to infrared, we observe a strong color dependence between the datasets, with higher variability at short (blue) wavelengths than at long (red) wavelengths (see also Aigrain et al. 2003). This is illustrated in Fig. 1 through the time series rootmeansquare (RMS) measurement, where the signature of the approximately elevenyear solar cycle is evident. To study the shorttimescale solar variability, we finally divided the detrended datasets into oneday subseries, and removed those subseries that had missing data and strong instrumental features. We obtained a total of 5912 regularly sampled oneday subseries (i.e., ≈275 subseries yr^{−1}). We use this dataset to characterize the properties of the granulation activity in the following subsections.
Fig. 1
Yearly RMS of the VIRGO time series for the blue, green, and red SPM channels (see colors). Solar minima at the beginning ofcycles 23 (from 1996 to 2008) and 24 (since 2009) are clearly visible. 
2.2 A stationary stochastic colored noise
The solar shorttimescale variability (periods < 1 day) is dominated by instrumental noise, oscillation modes, and convection. The distinct signatures of these three noise sources can be well discerned in the PSD of the VIRGO observations. Figure 2 shows an estimate of this PSD through the averaged periodogram defined as (Bartlett 1950)^{2}: (1)
with L being the number of available oneday subseries, , per year, N the number of data points of these subseries, and t_{j} = j ×dt the time of the measurements with j = 0, …, N. Also, dt is the temporal sampling and are the Fourier frequencies^{3} computed for k = 0, …, N − 1. We observe a strong frequency dependence of the PSD due to correlations over several timescales. This frequency dependence is characteristic of a noise that is defined as a colored noise, in opposition to a white Gaussian noise (WGN) that shows a constant PSD over the whole frequency range^{4}. The periodic signatures of solar oscillations are clearly visible around 3000 μHz. Granulation dominates the frequency range corresponding to a typical transit duration (gray area): between activity and oscillations. As long as the planet transit does not cross a spot or plage, this may be the dominant stellar noise source in highprecision photometric observations. By comparing the averaged periodograms computed using subseries taken during a minimum and a maximum of the solar cycle (see dashed and solid lines, respectively), we only note significant differences at low frequencies (ν < f_{ℓ} = 44.9 μHz), in agreement with Seleznyov et al. (2011). Although not visible in this comparison, we also note the presence of small changes of the pmodes signatures (both in amplitude and frequency) over the magnetic cycle, in line with findings by Chaplin et al. (2011), Salabert et al. (2011), and García et al. (2013). Interestingly, the frequency region that is dominated by the granulation variability, ν ∈ [f_{ℓ}, 2300] μHz, is not significantly affected by the solar magnetic cycle (see Seleznyov et al. 2011; Muller et al. 2018). Therefore, we can state that granulation variability is stationary with respect to the solar cycle (i.e., its statistical properties remain constant over the solar cycle).
Fig. 2
Averaged periodograms computed with Eq. (1) using all oneday subseries available during a minimum (2008, solid dashed lines) and a maximum (2003, solid solid lines) of the solar cycle. Each color indicates the observation of one SPM channel. Differences with the solar cycle are observed at frequencies ν < f_{l} (i.e., at periods >6.2 h). The peak at ν = 5570 μHz affecting the PSD of the blue and green SPM data is an electronic artifact related to the calibration period used by the acquisition system of VIRGO. The gray shaded area indicates the frequency range corresponding to the typical duration of planetary transit (from ~ 25 min to several hours). 
Fig. 3
Example of a oneday VIRGO time series (top) detrended (bottom) using a SG filter with a window size of 15 h (colored solid lines). From left to right: dataset from the blue, green, and red SPM channels. 
2.3 Time domain analysis: amplitude distributions
By nature, the influence of a stochastic process cannot be known exactly. Consequently, it is only possible to make a probabilistic statement about its behavior. Here, to characterize the shorttimescale variability in the time domain, we analyze how its amplitudes are distributed. By “amplitude”, here we mean the flux offset from the mean value of a given oneday subseries.
To do so, we first eliminated lowfrequency noise by applying a SavitzkyGolay (SG) filter (Savitzky & Golay 1964), parametrized by a window length of 15 h and a polynomial degree of 3. This allowed us to consider only the frequency range ν < 2 × f_{ℓ}. We note that, to avoid an inappropriate effect of the SG filter at the edges of the oneday subseries, we applied this lowfrequency noise removal to the whole oneyear time series (i.e., before dividing it into daily subseries as described in Sect. 2.1). This step is illustrated in Fig. 3, where we observe the wavelength dependence of these variations, which can reach several tens (red channel) to hundreds (blue channel) of parts per million (Dravins 1988). The amplitudes of the flicker noise therefore increase when observing deeper layers of the solar photosphere (Jiménez et al. 2005). Intuitively, we can expect this variability to follow a Gaussian distribution. Indeed, if we assume the dominant source of variability comes from convective motions, then this variability results from a multitude (~ 10^{6}) of independentstochastic granules, each generating photometric variability of a similar amplitude. This is a physical manifestation of the central limit theorem. To validate this hypothesis, we performed a ShapiroWilk normality test (Shapiro & Wilk 1965) on each solar oneday subseries. This results in the nonrejection of the Gaussian hypothesis for more than 95% of our dataset. Without establishing a formal proof of Gaussianity, this test does not argue against this hypothesis. In this approximation, we can therefore state that flicker noise is Gaussian (i.e., the amplitudes of the noise follow a Gaussian distribution). We note that Gaussianity here refers to the probability distribution of the amplitude of the flicker but does not inform on the way the power of this noise is distributed in the frequency domain (correlations). The latter remains, a priori, unknown (see Sect. 2.4). Gaussianity is important as it implies that noises correlated over timescales of minutes to hours (mostly convection processes) can be completely defined by the two first moments of their distribution: the mean and power spectral density (related to the variance, Simon 2006).
Other common measurements to characterize a stochastic noise are the RMS and the amplitude range. For granulation, the “eighthour flicker” measure (or “F8”), corresponding roughly to the RMS evaluated over a timescale between 0.5 and 8 h, is also often computed. To compute the F8 measurement as described in Bastien et al. (2016), we binned the solar oneday subseries into intervals of 30 min (to mimic the Kepler longcadence observations) and used a 16point boxcar filter^{5} to remove the longterm activity over periods >8 h. Figure 4 shows the distribution of these three quantities evaluated on the oneday VIRGO subseries taken over a year. In each panel, the distributions obtained using subseries taken during a solar cycle minimum (resp. solar cycle maximum) are shown by the colored (resp. black) histograms. Typical RMS values are 30− 40 ppm (red), 50− 60 ppm (green), and 70− 90 ppm (blue). Extreme values can reach several hundred ppm (200, 350, and 500 ppm for the red, green and blue channels, respectively). Although comparable to the global RMS, the F8 values are slightly smaller because they are based on data binned into intervals of 30 min and thus encapsulate less of the signal due to granulation. By comparing these values with the transit depth expected for an Earth analogue (see vertical lines), we gather that this variability can easily impede the transit detection or bias the inferred parameters.
In practice, binning the light curve of a single transit to decrease the impact of granulation noise is not optimal because of the long timescale of the stellar convection (Meunier et al. 2015). This is shown in Fig. 5, which illustrates the slower decrease of the RMS of the solar flicker noise compared to purely white noise. For highprecision photometric data, the shorttimescale variability cannot be approximated as a WGN. Instead, convection processes produce a colored stochastic noise. We note that acoustic modes are better described as deterministic noise sources (in the sense that they only affect specific frequencies depending on the stellar properties).
2.4 Frequency domain analysis: the solar power spectra
The total dispersion of the time series is a scaled sum of the power at each frequency component (Parseval’s identity, Li 2014, see Eq. (6.1.7), p. 169). As discussed in Sect. 2.2, the solar PSD exhibits correlations over a multitude of timescales. For a given characteristic timescale, one noise process may dominate the others.
Acoustic mode timescales
To localize the typical upper and lower frequency bounds of the acoustic modes, we use the autocorrelation function (ACF) of the observations, similarly to Kallinger et al. (2016). The first zerocrossing of the ACF is the autocorrelation time (t_{c}), that is, the time interval over which the noise decorrelates. For the solar time series, t_{c} corresponds to the corner frequency (f_{c}) and indicates the lower frequency of the pmodes (the separation between the regime dominated by the deterministic pmodes and those dominated by the stochastic convection). The upper limit of the pmode signature (f_{h}) can be identifiedby the first dip in the ACF. To precisely evaluate these frequencies, we computed the ACFs of each oneday subseries taken during one year and averaged them. The averaged ACF is shown in the main panel of Fig. 6 (black), showing f_{h} = 4000 μHz (~ 4 min) and f_{c} = 2000 μHz (~ 8 min).
Fig. 4
From left to right: empirical distribution of the RMS, amplitude range, and F8 measurements evaluated on a set of oneday VIRGO subseries taken during a solarcycle minimum (2008, colors are related to the three SPM channels). The empirical distributions obtained during a solarcycle maximum (2003) are shown by the black contours. Distributions derived using the HMI data described in Sect. 3 are shown by the yellow histograms.In each panel, the vertical dashed line indicates the transit depth expected for an Earth analogue (84 ppm). 
Fig. 5
Effect of data binning on the RMS of oneday solar subseries (1996, black), on the synthetic times series generated with Eq. (3) (yellow), and on synthetic subseries of WGNs (blue). The RMS of the WGN series decreases as (light blue). Solid lines indicate the median values of the observed dispersion (shaded areas). The RMS of the time series at large bin sizes shows a significant dispersion as the number of data points decreases. Horizontal dotted lines indicate the fraction of a typical Earthlike transit depth (i.e., 84 ppm). 
Convection timescales
The PSD of the VIRGO observations shows a “knee” shape in the frequency region ν < f_{c} (see inset panel of Fig. 6). The physical origin of this kink is debated and is not observed in radial velocity measurements. Works based on Harveyprofile functions (Harvey 1985) that fit for the stellar background generally attribute this kink to the regime where supergranulation or mesogranulation^{6} noise dominate. However, based on synthetic simulations of solar granulation alone, Seleznyov et al. (2011) attributed this frequency to the large dispersion of the granulation turnover timescales (the distribution of the granule lifetimes varies from 0 to 30 min). We estimate this frequency (that we called the flicker frequency in this work) at f_{g} = 555 μHz (30.3 min). We finally set the limit between the convection and the activity regimes (constrained here by the lowfrequency SG filter with a width of 15 h) at f_{ℓ} = 44.9 μHz (6.2 h).
According to Seleznyov et al. (2011), the physical properties of the stellar granule cells, such as the number of cells over the visible stellar surface, their photometric contrast ratio, their lifetime, and average size, can be extracted from the study of the observed stellar PSD. For example, we expect the frequency at which the PSD starts to become “flat” (i.e., f_{g}) to decrease with the increase of the granulation median turnover timescale. This would make this frequency a potential tracker for deriving the lifetime distribution of the granule cells over the stellar surface. Furthermore, the amplitude of the stellar PSD can be related to the average size of the granule cells. We should expect a decrease of the PSD amplitudes with the decrease of the size of the granules, as the ratio between bright granules and intergranular lanes decreases (Seleznyov et al. 2011).
In practice, the stellar PSDs are often modeled with Harvey functions (Harvey 1985). As discussed in Karoff (2012), the choice of this model is empiric and various slightly modified Harvey profiles are found in the literature (see e.g., Harvey 1985; Pallé et al. 1999; Aigrain et al. 2004; Kallinger et al. 2014; Cranmer et al. 2014; Pereira et al. 2019). The difference between these models mainly comes from the choice of the powerlaw exponent, chosen to fit the observed stellar PSD (Mathur et al. 2011). Consequently, we chose here to represent the noise correlations by using a simpler model based on 1∕ν^{α} power laws (Pallé et al. 1999). We refer to this model as the flicker model in the following. Written in logarithmic scale, the model is: (2)
with P_{L} being the yearly averaged periodogram of the oneday solar subseries defined as in Eq. (1), α the power index, β a constant, and the frequencies in the regions as labeled in Fig. 5, corresponding to the different regimes outlined above. The power index α and the constant offset β have to be determined for each frequency region .
We then estimated the parameter set given in Eq. (2) through a leastsquare minimization of the solar PSDs. The average values are listed in Table 1 and the estimated power indices for each of the 21 yr of VIRGO observations are shown in Fig. 7.
As visible in the PSD, the highest values of the power index α are found for the highest frequency region (A). For all regions considered, the power indices observed in the red channel appear relatively constant with time. For the green and blue channels however, which are more strongly influenced by the detector ageing, the power index drops significantly in recent years (> 2008) in the highfrequency regions A and B. We note that this effect is likely purely instrumental due to increasing white noise. For the timescales inferior to several hours (regions A, B and C), the index values do not vary along the solar cycle but they do for region D. Once again, these results illustrate both the global high quality of VIRGO observations and the stationarity of the flicker variability. Moreover, for ν < f_{c} (regions C and D), the similarities between the time series observed in different colors indicate similar correlations independently of wavelength. Consequently, alternatively to Harvey models, a flickerbased model can be designed based on simple power laws as in Eq. (2). Synthetic time series of this stationary stochastic process can then be generated as: (3)
with S(ν) = exp(β∕ν^{α}) being the parametric noise PSD based on Eq. (2), t the time of the observations, δν the sampling frequency, and ϕ the random phase ∈ [0, 2π]. Following Eq. (3), we generated synthetic time series based on parameters listed in Table 1. We compared these synthetic series with solar observations in Fig. 5. As soon as the number of data points is sufficiently large (i.e., at bin sizes <3 h), we observe a similar behavior for both series demonstrating that our simple flickernoise model is realistic enough to provide a firstorder approximation of shorttimescale solar variability.
Fig. 6
Estimation of the splitting frequencies. Main panel: L = 364 ACFs corresponding to the available solar oneday subseries observed in 1996 with the blue SPM channel (gray lines) and their mean ACF (black). The mean ACF allows the automatic derivation of the upper and lower frequencies surrounding the acoustic mode regime {f_{h}, f_{c} }. Inset panel: logarithm representation of the associated PSD estimates computed using Eq. (1). The PSD allows us to localize the flicker frequency {f_{g} }. The lower frequency {f_{ℓ}} is set by the window length of the SG filter applied to each oneday subseries to remove the longterm variability. In both panels,the splitting frequencies are indicated by the dotted vertical red lines. The blue dotted line represents the slope of the PSD (see Eq. (2)) measured in the frequency region associated with the granulation regime ν ∈ [f_{g}, f_{c}]. From high to low frequency, the distinct regions are denoted by letters A to D. 
Fig. 7
Evolution of the inferred power index estimated on averaged periodograms of oneday solar subseries. From left to right: index evaluated for (a) the highfrequency region, (b) the pmodes region, (c) the granulation region, and (d) the lowest frequency region. Each color corresponds to an SPM channel. In region (c), the index obtained on the HMI solar observations discussed in Sect. 3.1 is shown by the horizontal black line (with the 1σ uncertainties in gray). This index results from the averaged periodogram computed using the 91 oneday subseries that have been selected during the period of a solar minimum (2017–2019). 
3 Impactof solar shorttimescale variability on inferred transit parameters
Before extracting the statistical properties of flicker noise from stars other than the Sun, we aim to evaluate its impact on the parameters inferred from exoplanet transit light curves. To this end, we generated artificial light curves of planetary transits based on resolved images of the solar disk during a solarcycle minimum. In this section, we describe the analysis of these light curves assuming purely white Gaussian noise and quantify the errors made on the inferred parameters when the correlation properties of flicker noise are not correctly taken into account.
3.1 Artificial transit light curves in HMI observations
In order to generate realistic planetary transit light curves, we selected observations^{7} taken by the Helioseismic and Magnetic Imager (HMI) instrument onboard the Solar Dynamics Observatory (SDO). Since 2010, HMI has been observing the photospheric Fe I absorption line at 617.3 nm almost continuously, producing one image of the solar disk every 45 s. The resolution of HMI is 0.505 arcsec pixel^{−1} and the optical resolution is 0.91 arcsec, corresponding roughly to 366 km on the solar surface at disk center (Schou et al. 2012).
We selected 91 different dates during a minimum of the solar activity cycle where no significant signatures of active structures (spots, plages) were observed on the visible part of the solar surface. For each date, we downloaded images spanning one day and extracted the solar flux from each of them by integrating the intensity over all pixels.
To create artificial transit light curves of exoplanets in each oneday solar dataset, we superimposed a black sphere on the images, and moved it across the disk, mimicking a transiting planet. We assume exoplanet sizes of R_{p} = 1, 3, 5, 7, and 10 Earth radii (R_{⊕}). For each planet size, we simulated transits with impact parameters b = 0, 0.2, 0.4, 0.6 and 0.8. This led to 25 artificial transit light curves for each of the 91 solar time series. One of these, corresponding to R_{p} = 5 R_{⊕} and b = 0, is shown in the top panel of Fig. 8 (see gray line).
This way of generating artificial transit light curves has three effects that are extraneous to real exoplanet transits. First, since the SDO spacecraft is on an inclined geosynchronous orbit around Earth, the apparent size of the Sun on the CCD images changes with time. Second, as the size of the planet is scaled in terms of pixels, the planettosun radius ratio (p = R_{p}∕R_{⊙}) slightly changes over time. To increase the accuracy of p, which is needed to assume that the radius ratio is constant over time, we oversampled the artificial exoplanet by a factor of five. This means that on one pixel of the HMI image there are 5 × 5 pixels in the exoplanet mask^{8}. The exoplanet mask was then derived in the highresolution oversampled regime, and interpolated back to the original image resolution under conservation of flux within each pixel. This leads to partial pixel eclipses at the boundary of the exoplanet, and consequently to a greatly enhanced accuracy. In Appendix A, we show the intrinsic variation of the transit depth (p^{2}) over time, which is <0.02% of the true value, leading to relative errors between the transit models and the transits resulting from our experiment below 2 ppm. Finally, with this experiment, we assume discrete exoplanet positions for each HMI image, neglecting the movement of the planet during HMI’s 45s exposures. While the effect of exposure time is known to affect the transit parameters (Kipping 2010), we assume our temporal cadence to be sufficiently small to do not impact our resulting light curves.
The effect induced by the orbit of the recording satellite (period ~ one day) is visible in Fig. 8 as a longterm quasisinusoidal variation. From the analysis based on VIRGO data (see Sect. 2), we know that granulation noise dominates periods of <30 min. To correct the light curves from the effect of the satellite motion, we chose to filter each raw (i.e., without transits) solar time series with a smooth SG filter that has a passband larger than ten times the characteristic period of granulation (i.e., >5 h). An example of the final residuals resulting from this data filtering is shown in the bottom panel of Fig. 8. The corrected transit light curves are shown in Fig. 9 (see also Fig. 10 for a better display of a central transit of an Earthsized planet). We note that we also tried to perform this correction using GP, but to avoid any influence of the GP on the flicker noise, we chose to use simple smooth functions.
Finally, the HMI observations are given without error bars, which are necessary to derive the transit parameters (see Sect. 3.2) together with their uncertainties. We estimated the errors on the individual data points from the residual scatter of the transitfree light curves after correction of the variation due to the satellite motion. On average, we obtained σ = 20−30 ppm (see Appendix. B). The whole set of artificial transit light curves is publicly available online^{9}.
Fig. 8
Top: example of a raw solar light curve extracted from HMI images (black) and the corresponding smooth detrending function based on SG filters with a window size of 5 h (red). An artificial transit of a 5 R_{⊕} planet crossing the center of the solar disk (b = 0) is shown in gray. Bottom: residuals of the raw solar light curve after correcting by the SDO satellite motion. 
3.2 Impact of flicker on the inferred transit parameters
To retrieve the transit parameters, we used stateoftheart transit modeling based on the Mandel & Agol (2002) algorithm. For each artificial transit light curve associated to a given set of solar observations, we performed a Markov chain Monte Carlo (MCMC) analysis following the scheme described in Lendl et al. (2017, 2020) and using the differentialevolution MCMC engine developed by Cubillos et al. (2017).
We assumed the planet orbital period (1 yr) was known and fitted for the planettostar radius ratio (R_{p} ∕R_{s}), the epoch of midtransit (T_{0}), the impact parameter (b), the transit duration (t_{d}), and the quadratic limb darkening coefficients (u_{1}, u_{2}). We applied uniform priors to each of these parameters. The results are discussed in the following.
We found relative errors on R_{p}∕R_{s} increasing with decreasing planet size, which is expected as larger planets create deeper transits while the noise level remains similar. Absolute percentage errors (difference between the peak of the MCMC posterior and the true value normalized by the true value) on R_{p} ∕R_{s} appear small (<1–2% for planets with sizes above 3 R_{⊕}), but can be large (~10%) for Earthsized planets (see Fig. 11). For comparison, we generated synthetic light curves containing only WGN and derived the errors on R_{p}∕R_{s} using the same MCMC approach. To generate these synthetic WGN time series, we isolated the highfrequency noise present in the data by (i) removing the true transit model from our time series, (ii) applying a SG filter to filter out correlated noise at timescales above 15 min, and (iii) generating atransit light curve with the Mandel & Agol (2002) analytical model. For all involved transit parameters, we found errors on R_{p}∕R_{s} ≪ 1% (see red histograms in Fig. 11).
We now focus on the impact of the solar flicker noise on the inferred planettostar radius ratios. Figure 12 (left panel) shows that for Earthsized planets, the true values are not within the 1σ uncertainties for 49% of the cases when b = 0 and for 81% when b = 0.8. Moreover, the true value is not even included within the 3σ uncertainties for 11% of the realizations when b = 0 and for 4.5% when b = 0.8. For the largest exoplanets (10 R_{⊕}, see right panel), while the percentage error is very small (<1%), none of our inferred values contain the true radius ratio within their 1 and 3σ uncertainties when b = 0. This offset is not observed when the planet crosses the solar limb (b = 0.8). This effect is likely due to uncertainties on the limb darkening parameters that can bias the retrieved transit parameters (Espinoza & Jordán 2016). In line with this result, Cubillos et al. (2017) showed that MCMC analyses that ignore timecorrelated noise produce inaccurate transitdepth estimates and can largely underestimate their uncertainties. Both effects increase as the variance of the correlated noise increases.
For the transit duration, we found a distribution of the relative errors centered around 0%, and with a dispersion up to 2% for the Earthsized planet (see first row of Fig. 13). For this planet, we measure a dispersion around the true value (t_{d} = 13.09 hr) of ± 31.3 min for b = 0 (i.e., error up to 4%) and of ± 55.4 min (i.e., error up to 10%) for b = 0.8 (with the true value t_{d} = 7.97 hr).
For the time of midtransit, T_{0}, we found nooffset but a dispersion of the inferred parameters around the true value that is also quite large for the Earthlike planet: ± 19.2 min for b = 0 and ± 15.4 min for b = 0.8 (see second row of Fig. 13). Moreover (not shown here), the true T_{0} was not contained inside the 1σ uncertainties for more than 76% of the cases when b = 0 (28% fell outside the 3σ uncertainties) and 77% for b = 0.8 (25% fell outside the 3σ uncertainties). These errors can directly affect the measurement of transit timing variations in multiplanet systems.
Finally, for the impact parameter, the difference between the true and inferred value decreases with the size of the planet (see last row of Fig. 13). For the Earthsized planet, the distribution is almost uniform making the inferred impact parameter essentially unconstrained by the observations.
These results suggest that previous analyses of single transit events performed on light curves dominated by stellar noise (e.g., for bright solarlike Kepler targets, see Gilliland et al. 2011, 2015) may be miscalculated by a few percent due to a misunderstanding of the correlation structure of the shorttimescale stellar noise. All these errors can directly impact the characterization of the exoplanet. As a consequence, they can impact models of the interior structure of planets (see e.g., Dorn et al. 2015)and atmosphere. The present experiment demonstrates the need for new statistical tools dedicated to correctly accounting for the effect of flicker noise, and strategies to mitigate its effect on the derived transit parameters. These new tools will need to encapsulate the statistical properties of flicker that have been derived in this paper (see Sect. 2 for the Sun and the Sect. 4 for other main sequence stars).
Some studies have already focused on identifying the component of flicker noise in photometric data and modeling it. Recently, Morris et al. (2020) performed a similar study, also using HMI data to reproduce exoplanet transits. This study conceptually differs from ours: they used a single HMI image to estimate the flicker variability amplitudes. As a consequence, they underestimate the flicker noise within the resulting light curves. Then, to probe the effect of shorttimescale stellar variability on transits, they injected model transit light curves into transit free data. The strength of our study lies in the fact that we extract the effect of this variability on planetary transits without any prior assumptions (e.g., limb darkening) of the transit light curve shape, and at the same time we also account for variability introduced by the changing photospheric flux obscured by the planet.
Using 3D radiative hydrodynamical simulations of solar convection, Chiavassa et al. (2015) succeeded to evaluate the contributionof flicker on the transit light curve of Venus (which appeared in 2004). Moreover, these 3D simulations allow us to derive very accurate limbdarkening laws (Chiavassa et al. 2015), which is another critical aspect we must consider for deriving unbiased transit parameters as described above.
Other studies proposed to use GP to model the shorttimescale variability noise sources (including flicker, oscillations pmodes, and highfrequency noise; see e.g., Barclay et al. 2015; Pereira et al. 2019). Compared to analyses based on WGN models (as in this section), modeling the flicker noise with GP models increases the parameter uncertainties but could allow us to improve the accuracy on the transit parameters. However, this modeling approach alone does not improve the precision of the transit parameters. A noteworthy advantage of the GP approach lies in its capacity to constrain the stellar noise properties. For example, Pereira et al. (2019) found that their GP regression is able to derive accurate values of the pmode mean oscillation frequency ν_{max}. Following this same idea of linking the properties of the flicker noise with the stellar parameters, we now investigate the correlations associated to flicker using Kepler observations for a range of very bright stars on (or near) the main sequence.
Fig. 9
Example of artificial exoplanet transit light curves generated using solar HMI observations (quiet Sun, 20181210). Each panel shows five transits for planets with sizes ranging from 1 to 10 R_{⊕} (see legend). Panels from left to right: different orbit configurations (see the panels’ header). 
Fig. 10
Top: example of artificial transit of an Earthsized planet crossing the disk center of the Sun (b = 0, black). The transit model with the true input parameters is shown in green and the model computed using the inferred parameters in red. The error on R_{p} ∕R_{s} is around 2% in this example. Bottom: residuals based on the inferred transit model. 
Fig. 11
Distribution of the absolute percentage error on the planettostar radius ratio inferred from our simulated transits for planets with sizes R_{p} = [1, 3, 5] R_{⊕} (left to right) and impact parameters b = 0 (top) and b = 0.8 (bottom). Inferred errors on radius ratios derived from light curves containing only WGN are shown in red. 
Fig. 12
Planettostar radius ratio and 1σ uncertainties inferred from the MCMC analyses performed on the artificial light curves of exoplanets of size R_{p} = 1 R_{⊕} (left) and R_{p} = 10 R_{⊕} (right). The case of b = 0 is shown in black and b = 0.8 in red. The true radius ratios are indicated by the horizontal dotted lines. 
Fig. 13
From top to bottom: distribution of the absolute percentage errors on t_{d}, T_{0}, and b for the whole set of artificial transit light curves. 
4 Shorttimescale stellar variability on Kepler stars
In this section, we aim to extract the granulation properties using Kepler observations of bright stars. The flicker amplitude is already known to be related to the stellar parameters (see e.g., Bastien et al. 2016, based on F8 measurements). Therefore, we focus here on the relation between the flicker characteristics timescales and the stellar fundamental parameters.
4.1 Kepler shortcadence observations
The Kepler prime mission was operating from 2009 to 2013 (Borucki et al. 2010). It operated in the optical wavelength range λ ∈ [400, 865] nm: a much broader passband than that of the solar observatories VIRGO and HMI (see Sects. 2 and 3). To compare Kepler images with solar observations from VIRGO, previous studies often used a sum of the red and green channels as these are the closest to the Kepler passband (see e.g., Basri et al. 2013; Salabert et al. 2017). Kepler longcadence observations (29.4 min) have been intensively studied to derive the longtimescale stellar variability for stars of different stellar types (Basri et al. 2013; McQuillan et al. 2012) as well as the shorttimescale variability evolving periods of less than one day (see e.g., Mathur et al. 2011; Cranmer et al. 2014; Bastien et al. 2016; Pande et al. 2018). For the latter, it has been shown that the granulation amplitude (F8 measurements) can be linked to the stellar surface gravity. The same can be applied for the turnover granulation timescales (i.e., the period corresponding to f_{c}, Kallinger et al. 2016).
In this section, we focus on the shortcadence (SC, 58.8 s) Kepler observations that have been performed on a small number of stars, as longcadence observations do not carry information about noise correlations at timescales below 30 min, where most of the granulation signal is located. We focus here on the determination of the correlation properties that we defined in Sect. 2.4 through a flicker power index α_{g} (measured as the slope of the PSD in the frequency range between the corner and flicker frequencies, f_{c} and f_{g}, respectively).
For this purpose, we selected the brightest stars observed by Kepler in SC mode (apparent magnitude m_{v} < 11.5), for which no planet has been detected (a total of 3970 objects). From this sample, we removed binary stars, rotationally variable stars, stars in clusters, peculiar stars, and red giants (ending up with 1401 objects). At this point, our sample contained mainly G and F stars as latetype stars (K to M) were rejected by the magnitude cutoff. For each star from our sample, we downloaded the whole set of SC observations and detrended the light curves following a similar procedure as for the VIRGO time series (see Sect. 2). As the Kepler spacecraft rotated by 90 degrees every 90 days (to keep the solar panels in the direction of the Sun), the observations of a given star are divided into four subseries (called “quarters”) per year. For each target and quarter, we carry out the following procedure: (1) we remove the data points affected by spacecraft safemode events and corrected for the background; (2) we smooth the time series using a running average of 3 days length and localize the 3σ outliers; (3)excluding these outliers, we bin the resulting time series into intervals of 24 h, apply a spline function to this binned series, and use this function to normalize the initial time series (containing the outliers of step 2.); and (4) finally we remove the 5σ outliers from the time series.
Following this procedure we combined all observations of the same target from all quarters and split the final detrended time series into onedaysubseries. For each oneday subseries, we filtered out the lowfrequency noise associated with magnetic activity using a SG filter with a 15 h passband.
For all targets, the photometric contribution of granulation to the high frequencies (HF) cannot be observed (contrary to VIRGO observations; see region ν < 5000 μHz in Fig. 2). Indeed, the HF noise in these Kepler data is nonnegligible with an amplitude comparable to granulation noise.
Fig. 14
Example of an averaged periodogram of the Kepler target KIC 7940546 (M_{s} = 1.152 M_{⊙}, R_{s} = 1.807 R_{⊙}, T_{eff} = 6244 K, m_{v} = 7.397). The number of available oneday subseries for this target is L = 75. Cutoff frequencies f_{c} and f_{g} resulting from the MCMC analysis are indicated by the vertical dashed lines. 
4.2 Frequency domain analyses
For each target, we selected the whole number of available oneday subseries ensuring a minimum number of data points per series. The required condition for a subseries to be considered is that it must have no more than 10% missing data, with a maximum length for a single gap of five consecutive data points. Using these subseries, we then generated the corresponding averaged periodogram using Eq. (1) for each target. An example of one of these periodograms for an F6IV star isshown in Fig. 14.
As in Sect. 2, we aim to describe these stellar PSDs with power law functions defined as in Eq. (2). For this purpose, the target PSDs have to be split into four regions that are: the highfrequency region (A) delimited by f_{h}, the pmode region (B) delimited by f_{h} and f_{c}, the flicker region (C) delimited by f_{c} and f_{g}, and the lowfrequency region (D) delimited by f_{g} and the frequency cutoff of the SG filter, f_{ℓ}. These regions are labelled in Figs. 5 and 14. To improve the automatic identification of these cutoff frequencies, and because the Kepler observations are noisier than solar VIRGO observations, we made use of a priori knowledge on the mean oscillation frequency (ν_{max}) to help the localisation of the oscillation pmodes (which are not clearly visible in all Kepler periodograms). Following Brown et al. (1991), Kjeldsen & Bedding (1995), and Belkacem et al. (2011), we estimated ν_{max} based on the stellar parameters as: (4)
with ν_{max,⊙} = 3150 μHz being the solar value. Moreover, we fixed frequency f_{h} to the solar VIRGO value (i.e., 4000.15 μHz, see Sect. 2) as we found this value to be a good estimate for all considered stars with detectable acoustic modes signatures (mostly G and F stars). For each star, we then inferred the parameters of Eq. (2) and the cutoff frequencies using MCMC analyses. For each MCMC, the fitted (“jump”) parameters are the two cutoff frequencies f_{c} and f_{g}, the three associated power indices , and constants as defined in Eq. (2). The starting values were set to the solar values obtained in Sect. 2 and no other prior than ν_{max} was used to avoid influencing the results.
For approximately twothirds of the targets in our sample, we found a flicker index α_{g} < 0.1 with large uncertainties, which means that the PSD slope in the frequency range of granulation is not clearly detected. As granulation signals are undetectable for these targets, we do not consider them in the following analysis. Our final set of targets contains 335 stars, among which we selected the 82 “best” targets, as these are bright (m_{v} < 10, see Figs. 15), which makes the measurement of the granulation parameters (f_{c}, f_{g} and α_{g}) more precise. The values for f_{g} and α_{g} resulting from the MCMC analyses of our 335 targets are shown as a function of the stars’ fundamental parameters^{10} in Figs. 16 and 17 (top panel).
In both figures, the subsample of best targets is highlighted. Moreover, we quantified the strength of any potential correlation between the fitted parameters (f_{g} and α_{g}) and the stellar parameters using the Pearson (ρ_{P}) and Spearman’s (ρ_{S}) coefficients (see numerical values in each panel).
As shown in Fig. 16, we observe strong correlations (i.e., ρ_{P} , ρ_{S} > 0.2) between the flicker frequency f_{g} and the stellar mass, radius, and surface gravity. These correlations are particularly noteworthy when placing the solar f_{g} value derived from VIRGO data on these plots (shown as star symbols). As expected, the granulation timescales decrease, that is, the flicker frequencies increase, for decreasing stellar mass and radius. This is in agreement with the work of Mathur et al. (2011), who extracted the characteristic timescales of granulation based on different Harvey law fits on Kepler red giant stars. We note that no significant correlation is observed with the stellar magnitude (ρ_{P}, ρ_{S}~ 0.1), which means that the flicker frequency can be derived independently of the HF noise level for all targets for which granulation as a whole is detectable (i.e., α_{g} > 0.1).
In Fig. 18, we also show the characteristic frequencies f_{c} and f_{g} as a function of the oscillation frequency ν_{max} that has been derived using Eq. (4). We observe clear linear correlations between these three stellar characteristic frequencies. This indicates that the typical granulation timescales, combined with the mean frequency of the acoustic modes, may be able to track stellar characteristics, such as the stellar surface gravity (Kallinger et al. 2016).
In Fig. 17 (top panel), we display the inferred power index α_{g} resulting from the MCMC analyses as a function of the stellar parameters. As for parameter f_{g}, we observe strong correlations between this parameter and stellar mass, radius, surface gravity, and acoustic oscillation frequency ν_{max} (see last column). However, we also observe a significant correlation with stellar apparent magnitude. This indicates that the inferred flicker indices are influenced by the high level of HF noise, which biases the correlations seen with the stellar parameters. This is particularly evident when comparing the inferred α_{g} with the flicker index extracted from VIRGO solar observations (which have very low HF noise σ_{W}~ 5 ppm, shownwith stars symbols). To compare the solar flicker index with Kepler observations, and to coherently interpret Kepler data, we need to find a way to combine flicker indices computed from PSDs with different white noise levels. For this purpose, we added different levels of synthetic WGN with variance to the solar data and computed the flicker indices of these data as done for the Kepler observations. Figure 19 shows the fast decrease of the solar flicker index with increasing HF noise (σ_{W}). The black dots represent the flicker index measured on the averaged periodogram of several Sunlike stars observed by Kepler as a function of the level of HF noise (see list in Table C.1). These values illustrate that correlated noise due to granulation becomes more difficult to detect with increasing HF noise. Uncertainties on α_{g} are directly related to the number of oneday subseries available to compute the averaged periodogram. When correcting the measured solar flicker index in Fig. 17 (star symbols) for the effect of additional WGN corresponding to that present in Kepler observations, we found a solar flicker index in better agreement with the correlations observed between the Kepler starsand the stellar parameters (see square symbols).
However, to derive the correct relation between the flicker index and the stellar parameters, we have to account for different levels of HF noise when comparing the derived flicker indices. This is the objective of the following section.
Fig. 15
Selected Kepler targets represented as a function of their mass and radius. The color code indicates the inferred index parameters in the frequency region of granulation (α_{g}) derived aftercorrection by the HF noise. The best targets with R^{2} > 0.9 are shown in color (see Sect. 4.3). 
Fig. 16
Estimated values of the cutoff flicker frequency f_{g} as a function of stellar parameters resulting from the MCMC analyses performed on periodograms of the selected SC Kepler targets. From left to right: stellar mass, radius, effective temperature, surface gravity, and apparent magnitude. Black dots represent targets with magnitude m_{v} ≤ 10 and gray dots targets with magnitude m_{v} > 10. The star symbol shows the solar cutoff frequency evaluated from VIRGO data (see Sect. 2.4). Pearson (ρ_{P}) and Spearman’s (ρ_{S}) coefficients, evaluated for stars with m_{v} ≤ 10, are indicated in each panel. 
Fig. 17
Estimated flicker index associated to granulation (α_{g}) as a function of stellar parameters. From left to right: stellar mass, radius, effective temperature, surface gravity, apparent magnitude, and normalized ν_{max} resulting from (4). Top: values obtained from the MCMC analyses. The color code indicates the apparent magnitude of the target: m_{v} < 10 (black) and m_{v} > 10 (gray). The star symbol represents the index derived for the Sun based on VIRGO green channel observations: α_{g} = 1.26 with σ_{W} = 5 ppm. The square symbol represents the index obtained after adding a HF noise level (corresponding to that seen in Kepler observations of the Sunlike star KIC 3427720) to the VIRGO subseries. Bottom: values obtained after interpolation that corresponds to a HF level of σ_{W} = 30 ppm (see Sect. 4.3). The color code indicates here the data with R^{2} > 0.8 (i.e., the best fits, black) and R^{2} > 0.5 (blue). The triangle symbol represents the value of the flicker index derived by adding a WGN of σ_{W} = 30 ppm in solar VIRGO observations, for which we find: α_{g} = 0.9464. This is the raw level of HF noise we expect for Sunlike stars observed with CHEOPS. Pearson (ρ_{P}) and Spearman’s (ρ_{S}) coefficients associated with these plots for the best targets (R^{2} > 0.9) are indicated on each panel. 
Fig. 18
Correlations between ν_{max} (determined using Eq. (4)), the corner frequency, and the flicker frequency. The corner and flicker frequencies have been derived for each Kepler target using the MCMC analysis described in Sect. 4.2. The color code indicates the apparent magnitude of the target: m_{v} < 10 (black) and m_{v} > 10 (gray). Solar values derived from VIRGO observations are shown by the yellow star in each panel. Pearson (ρ_{P}) and Spearman’s (ρ_{S}) coefficients associated with these plots are indicated on each panel. 
Fig. 19
Estimated values of the flicker index as a function of the HF noise level added in the VIRGO time series (red, blue and green SPM channels). Symbols show the power index measured on the PSD of Kepler Sunlike stars listed in Table C.1. 
4.3 Flicker index derived at a constant HF noise level
To correct for the influence of the HF noise level on the flicker index and derive unbiased correlations between α_{g} and the stellar parameters, we choose to rely on interpolation techniques.
For each Kepler target, we first empirically measured the decrease of the power index as a function of added HF noise (σ_{W}). We proceed in the same way as for VIRGO solar observations (see Sect. 4.2 and Fig. 19). For each considered value σ_{W}, we added a synthetic WGN to the available oneday subseries, computed the averaged periodogram, and derived the flicker index associated to the frequency region ν ∈ [f_{g}, f_{c}], with f_{g} and f_{c} the flicker cutoff frequencies of the star in question (see Sect. 4.2). An example is shown in Fig. 20 for a bright Fstar (black dots). For this particular target, we measured an initial HF noise level of σ_{W} = 69 ppm that corresponds to α_{g} = 1.23 (red dot). We then used the measured decrease to extrapolate the flicker index towards smaller values, using an exponentially decreasing function of the form: (5)
with parameters to be fitted. We performed a leastsquare regression of our empirical curve α_{g} (σ_{W}) to derive the coefficients {a, b, c} (gray line in Fig. 20). For this example target, we found a corrected power index of α_{g} = 2.1 at the HF noise level of the solar VIRGO observations (i.e., σ_{W} = 5 ppm).
We performed similar leastsquare regressions for all targets in our sample. As expected, the quality of our interpolations depends on the initial level of HF noise that is present within the data: the higher the HF noise, the more inaccurate the interpolated flicker index at low σ_{W}. To measure the goodnessoffit, we used the coefficient of determination (also known as rsquared coefficient R^{2}), that gives an idea of the distance between the best fit and the observed data points (Heinisch 1962).
We disregarded all targets with a corrected power index with R^{2} < 0.5, leaving us with 245 Kepler targets with R^{2} > 0.5, among which 118 have R^{2} > 0.8. The corresponding parameters {a, b, c} associated with model Eq. (5) for these remaining targets are shown as a function of the stellar parameters in Fig. 21. We observe strong correlations between these parameters and the stellar parameters. The combination of these correlations with Eq. (5) allows us to estimate the flicker index we will observe for a target star observed with a given level of HF noise and make some predictions for future highprecision observations of CHEOPS and PLATO (see Sect. 5).
We then chose a reference level of σ_{W} = 30 ppm, because this is close to the HF noise level expected to be reached with CHEOPS for Sunlike stars with a magnitude m_{v} < 8 (see following Sect. 5). The corrected indices (interpolated at the level of σ_{W} = 30 ppm) are shown as a function of the stellar parameters in the bottom panel of Fig. 17. Comparing with the raw flicker indices of the top panels, we observe more significant correlations with the stellar parameters, as expected. The Pearson and Spearman’s coefficients reveal strong positive correlations with the stellar mass and radius and negative correlation with the surface gravity, in particular when considering only the best targets (R^{2} > 0.9, black dots). If we include the whole Kepler sample (i.e., 0.5 < R_{2} < 1, gray dots), the correlations are slightly less pronounced as the flicker indices show a larger dispersion. We expect these relations to become increasingly precise with future highprecision observations of CHEOPS and PLATO.
Fig. 20
Flicker index α_{g} as a function of the level of HF noise (black) for the Fstar KIC 7940546. The first value, computedfrom raw Kepler observations, is indicated by the red dot and corresponds to α_{g} = 1.23 and σ_{W} = 69 ppm. The gray line shows the interpolated function (see Eq. (5)), for which we obtained a quality factor of R^{2} = 0.96. The interpolated index at σ_{W} = 30 ppm is α_{g} = 1.69 (blue triangle). 
5 Predictions for CHEOPS and PLATO
CHEOPS isthe first ESA Sclass mission. Its objective is to characterize transiting extrasolar planets with highprecision photometric observations (Fortier et al. 2014). The passband (λ ∈ [400, 1100] nm) and high cadence (1 min) of CHEOPS will be similar to the Kepler SC observations. However, CHEOPS will mainly focus on bright stars making this instrument a very promising tool to characterize the stellar variability affecting highprecision observations. For example, Moya et al. (2018) recently analyzed the detectability of the oscillation frequency ν_{max} on main sequence bright stars that will be observed with CHEOPS. These latter authors found that ν_{max} will be detectable on most main sequence stars, which can help to precisely constrain the age, mass, radius, and density of the host stars, also aiding the characterization of the observed transiting planets.
In this section, we explore to what extent the upcoming missions CHEOPS and PLATO will allow us to characterize stellar granulation through the power index defined in Sect. 2. We consider flicker noise to be detectable in light curves with inferred power indices of α_{g} > 0.2, while HF noise dominates otherwise. Our objective is to derive the limiting magnitudes (m_{v,lim}) for which our measurements possess the necessary precision to measure at least this limiting flicker index α_{g,lim}.
In Sect. 4, we showed the dependence of this index on the level of HF noise, which is related to the stellar apparent magnitude of the target star. We also derived the relation between the flicker indexand the level of HF noise through Eq. (5), which involves a set of parameters {a, b, c}. These parameters are correlated with the stellar mass (M_{s}) and radius (R_{s}, see Fig. 21).
In the following, we consider main sequence G and Ftype stars (as well as slightly evolved Fstars) that are known to host a convective envelop. Our set of stellar parameters {M_{s}, R_{s}} encapsulates:

stars with 0.8 M_{⊙}≤ M_{s} < 1.4 M_{⊙} and 0.8 R_{⊙}≤ R_{s} ≤ 2.5 R_{⊙},

stars with 1.4 M_{⊙}≤ M_{s} ≤ 1.5 M_{⊙} and 1.7 R_{⊙}≤ R_{s} ≤ 2.5 R_{⊙},

stars with M_{s} = 1.6 M_{⊙} that have a shallow convective envelope, though still present, at R_{s} = 2.1, 2.2 and 2.3 R_{⊙}.
We do not include stars of M and K spectral types with M_{s} < 0.8 M_{⊙} because the flicker indices of such stars were not constrained by our sample of Kepler observations (see Sect. 4). Moreover, according to stellar models based on the Code Liégeois d’Evolution Stellaire (CLES) stellar evolution code (Scuflaire et al. 2008; Fernandes et al. 2019), no sufficiently thick convective envelopes are expected for stars with masses above 1.6 M_{⊙}.
For each set of parameters θ_{s} := {M_{s}, R_{s}}, we first predicted the values of parameters {a, b, c} as defined in Eq. (5) using linear and quadratic functions of the stellar mass and radius. We found the following relations (see red lines in see Fig. 21): (6)
We then derived the HF noise level (σ_{W,up}) corresponding to the limiting flicker index α_{g,lim} using Eq. (5) with parameters {a, b, c} given by Eq. (6). For each target in our defined {M_{s}, R_{s}} grid, we came up with the highest HF noise level, σ_{W,up}, that is acceptable to observe a flicker index α_{g} ≥ α_{g,lim}. We then derived the limiting magnitudes m_{v,lim} corresponding to σ_{W,up}. To do so, we computed the expected precision for CHEOPS under the assumption that the targets will be observed during times when stray light from the Earth is no more than 0.62 phot s^{1} pix^{1}, a medium value found from simulation, accounting for all other noise sources as done in the mission’s Exposure Time Calculator^{11}. We used a 5920 K blackbody SED (similar to that of a G0 star, typical for our sample), and a time window of one hour.
However, it is important to note that the number of oneday subseries (L) also plays arole as the variance of the averaged periodogram at a given frequency ν decreases with L (and so do the uncertainties on α_{g}).
The limiting magnitudes (i.e., those corresponding to σ_{W,up}) derived for the grid of {M_{s}, R_{s}} are shown in the left panel of Fig. 22. For solarlike stars, flicker noise will become relevant for magnitudes brighter than m_{v} ≤ 10. This encapsulates a large fraction of the expected targets observed with CHEOPS. However, for slightly evolved Fstars, the limiting magnitude will be around 13 and therefore flicker noise is expected to be relevant in most if not all CHEOPS light curves of bright Ftype stars. For these stars, precise flicker index values should be measurable.
To make predictions for the PLATO mission (Rauer et al. 2014), we use the precision estimate by MarcosArenal et al. (2014), who (when using 24 cameras) quote an expected noise level of 27, 34, and 80 ppm per hour for stars with m_{v} = 10.8, 11.3 and 13, respectively. We show the limiting stellar magnitudes expected from future PLATO observations in the right panel of Fig. 22. We predict a higher impact of flicker noise in PLATO light curves, and expect that the characterization of the flicker properties (amplitudes and timescales) should be well feasible for most F and G stars.
We see through the analysis of solar observations (see Sect. 3) that flicker variability can lead to significant errors on the inferred transit parameters of the smallest planets in the case of a single (or a small number of) observed transit(s). The development of accurate noise modeling procedures will not only allow us to decrease the errors on the transit parameters, but also to strengthen our understanding of the underlying link between the noise properties and stellar physics (see Sect. 4).
Fig. 21
Coefficients {a, b, c} involved in Eq. (5) as a function of the Kepler stellar parameters. Red curves represent the linear and quadratic functions described in Eq. (6). 
Fig. 22
Illustration of the limiting stellar apparent magnitude, depending on the stellar parameters, (radius and mass) that is needed to measure a flicker index with α_{g} > 0.2 with the future CHEOPS (left) and PLATO (right) highprecision observations. 
6 Conclusions
We present a statistical characterization of the shorttimescale stellar variability associated (mainly) with granulation noise. Based on solar observations, we find this noise source to be: (i) stochastic, (ii) colored, (iii) stationary with respect to the solar cycle, and (iv) wavelength dependent. It can generate variability of several hundred parts per million in amplitude. In the relevant frequency region of the PSD, the introduced correlations can be modeled by a simple power law. We chose to use the power index resulting from fits to the PSD as an indicator of the noise correlations. We used HMI images of the Sun to create artificial transit light curves of hypothetical planets transiting the Sun, and analyzed the impact of this flicker noise on the inferred transit parameters. We showed that flicker noise is critical for the smallest planets, for which we find that the inferred parameters can be substantially offset from their true values. This however is likely due to inaccurate limbdarkening parameters which have been shown to introduce biases of the same magnitude by Espinoza & Jordán 2016.
We then turned to Kepler shortcadence observations to extract the dependence of the flicker power index on the stellar parameters. We found the inferred power index values to be heavily affected by the level of highfrequency noise (which is related to the stars’ apparent magnitude). Correcting for this influence, we observe a strong correlation between the corrected indices and the stellar radius, mass, and surface gravity. No clear correlation is observed with stellar effective temperature. These correlations confirm the already known relation between this stellar variability and the stellar properties (e.g., see Bastien et al. 2016).
Using this interpolated power index and the observed dependence with the stellar parameters, we predicted the limiting stellar apparent magnitude for which flicker noise will be characterizable with future highprecision observations of CHEOPS and PLATO. We find that the signature of this noise will be observable for most of the CHEOPS and PLATO light curves of Solarlike targets. This study highlights the need to design robust signal processing routines adapted to the characteristics of flicker noise in order to reduce errors on the inferred parameters of small exoplanets, which will be the objective of forthcoming studies.
Acknowledgements
The authors would like to thank René Heller for his peer review containing very useful suggestions, as well as D. Mary for his constructive comments. S.S., M.L., L.F. and P.C. acknowledge support from the Austrian Research Promotion Agency (FFG) under project 859 724 “GRAPPA”. V. Van Grootel is a F.R.S.FNRS Research Associate. The VIRGO instrument onboard SoHO is a cooperative effort of scientists, engineers, and technicians, to whom we are indebted. SoHO is a project of international collaboration between ESA and NASA. This paper includes data collected by the Kepler mission. Funding for the Kepler mission is provided by the NASA Science Mission directorate.
Appendix A Validity of the artificial transit lightcurve modeling
The artificial transit light curve experiment is based on solar observations and is subject to unresolved phenomena compared to true extrasolar planet transit events: the change of the solar radius over time and the variability of the number of the covered pixels by the black body sphere mimicking the planetary transit. This leads to temporal variations of the amount of masked solar surface that can introduce errors when applying traditional light curve models, such as those by Mandel & Agol (2002).
To quantify the error on the transit depth, we computed the ratio of the sum of the number of pixels covered by the exoplanet to the sum of the number of pixels of the solar disk. Both sums evolve as a function of time. We then measured this ratio intransit (δ_{in}) and compared it to the true transit depth (δ), taken as the square of the planet radius over the solar radius. The distribution of the percentage error for each set (R_{p}, b) is shown in Fig. A.1.
We observe a percentage error that is <0.01% of δ for each set of parameters. This value is far smaller than the global error found for the inferred transit parameters due to the flicker noise (see Sect. 3.2). It leads to a difference between the transit model and the artificial transit without noise that is <1 ppm intransit and <2 ppm in the ingress and egress regions of the transits. We note slightly larger differences in the ingress and egress regions due to variation of the exact number of pixels covered by the planet as a function of time.
We conclude that our experiment generating artificial transit light curves in solar observations is reliable as the temporal variability of the size of the surface area covered by the planet is not significant. This approximation of a constant planettosun radius ratio is valid when oversampling the raw HMI observations by a factor of two (not shown) or more. In our experiment, we oversampled each pixel of the HMI images by a factor of five as we found this value to be a good compromise between computational cost and constant number of covered pixels over time.
Fig. A.1
Distributions of the percentage error on the transit depth (δ) shown for the artificial transits generated using one solar time series (20181210). Each panel represents the errors for a different planet size (R_{p} = 1, 3, 5, 7, 10 R_{⊕} from top to bottom, resp.) and impact parameter (see legend). The temporal evolution of the ratio of covered to uncovered numbers of pixels has been measured in transit (δ_{in}). 
Appendix B Synthetic error bars added to solar HMI observation
Errors are not given for HMI observations but are necessary information to run the MCMC analyses and derive the uncertainties on the inferred transit parameters. To add synthetic error bars on our artificial light curve dataset we turned to GP (Rasmussen & Williams 2005). The GP modeling aims to roughly correct for the correlated components of the flicker noise to extract the remaining whitened scatter noise. This is a flexible noise modeling commonly used in the exoplanet community to take into accountthe correlated stochastic noise within the observations. For that purpose, we use the George package developed by Ambikasaran et al. (2015).
We chose to parametrize the noise covariance matrix in the solar observations (without artificial transit) with a product of two kernels that roughly describe the noise correlation: a constant (γ) and the Matèrn 3∕2 kernel, the latter being known to be flexible regarding unexpected local behavior of the observations and have already been used for modeling the shorttimescale granulation noise (Giles et al. 2018). Explicitly, this kernel writes:
with ℓ being the kernel’s metric and x = t_{i} − t_{j} the data inputs associated to the ith and jth data points, respectively. The covariance matrix, K, is then:
with σ_{i} being the uncertainties of the observations at time i and δ_{ij} being the Kronecker delta.
We performed a GP regression following the method described in Gibson et al. (2012) with the fit quality determined by minimizing the negative loglikelihood function corresponding to our GP model:
with r being the data residuals and N the number of data points.
An example of a GP fit is shown in Fig. B.1. For each dataset, we measured the standard deviation (σ) of the data residuals. We obtained σ = 20−30 ppm depending on the considered solar time series. We used these error bars in the MCMC simulations described in Sect. 3.2.
Fig. B.1
Top: example of a raw solar dataset (black) and mean of the predictive distribution of the GP model (red). Bottom: residuals of the solar dataset corrected by the GP model. The standard deviation of these residuals is used as input synthetic errorbars in our artificial transit light curves. 
Appendix C List of Kepler Sunlike stars
Studied Kepler Sunlike stars and their inferred flicker index.
References
 Aigrain, S., Gilmore, G., Favata, F., & Carpano, S. 2003, ASP Conf. Ser., 294, 441 [NASA ADS] [Google Scholar]
 Aigrain, S., Favata, F., & Gilmore, G. 2004, A&A, 414, 1139 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
 Ambikasaran, S., ForemanMackey, D., Greengard, L., et al. 2015, IEEE Trans. Pattern Anal. Mach. Intell., 38, 252 [NASA ADS] [CrossRef] [Google Scholar]
 Barclay, T., Endl, M., Huber, D., et al. 2015, ApJ, 800, 46 [NASA ADS] [CrossRef] [Google Scholar]
 Bartlett, M. S. 1950, Biometrika, 37, 1 [CrossRef] [MathSciNet] [Google Scholar]
 Basri, G., Walkowicz, L. M., & Reiners, A. 2013, ApJ, 769, 37 [NASA ADS] [CrossRef] [Google Scholar]
 Bastien, F. A., Stassun, K. G., Basri, G., & Pepper, J. 2016, ApJ, 818, 43 [NASA ADS] [CrossRef] [Google Scholar]
 Belkacem, K., Goupil, M. J., Dupret, M. A., et al. 2011, A&A, 530, A142 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
 Borucki, W. J., Koch, D., Basri, G., et al. 2010, Science, 327, 977 [NASA ADS] [CrossRef] [PubMed] [Google Scholar]
 Brown, T. M., Gilliland, R. L., Noyes, R. W., & Ramsey, L. W. 1991, ApJ, 368, 599 [NASA ADS] [CrossRef] [Google Scholar]
 Chaplin, W. J., Bedding, T. R., Bonanno, A., et al. 2011, ApJ, 732, L5 [Google Scholar]
 Chiavassa, A., Pere, C., Faurobert, M., et al. 2015, A&A, 576, A13 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
 Chiavassa, A., Caldas, A., Selsis, F., et al. 2017, A&A, 597, A94 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
 Corsaro, E., Mathur, S., García, R. A., et al. 2017, A&A, 605, A3 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
 Cranmer, S. R., Bastien, F. A., Stassun, K. G., & Saar, S. H. 2014, ApJ, 781, 124 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
 Cubillos, P., Harrington, J., Loredo, T. J., et al. 2017, AJ, 153, 3 [NASA ADS] [CrossRef] [Google Scholar]
 Dorn, C., Khan, A., Heng, K., et al. 2015, A&A, 577, A83 [EDP Sciences] [Google Scholar]
 Dravins, D. 1988, IAU Symp., 132, 239 [NASA ADS] [Google Scholar]
 Espinoza, N., & Jordán, A. 2016, MNRAS, 457, 3573 [NASA ADS] [CrossRef] [Google Scholar]
 Fernandes, C. S., Van Grootel, V., Salmon, S. J. A. J., et al. 2019, ApJ, 879, 94 [NASA ADS] [CrossRef] [Google Scholar]
 Fortier, A., Beck, T., Benz, W., et al. 2014, Proc. SPIE, 9143, 91432J [Google Scholar]
 Fröhlich, C., Romero, J., Roth, H., et al. 1995, Sol. Phys., 162, 101 [NASA ADS] [CrossRef] [Google Scholar]
 Fröhlich, C., Andersen, B. N., Appourchaux, T., et al. 1997, Sol. Phys., 170, 1 [NASA ADS] [CrossRef] [Google Scholar]
 García, R. A., Salabert, D., Mathur, S., et al. 2013, J. Phys. Conf. Ser., 440, 012020 [NASA ADS] [CrossRef] [Google Scholar]
 Gibson, N. P., Aigrain, S., Roberts, S., et al. 2012, MNRAS, 419, 2683 [NASA ADS] [CrossRef] [Google Scholar]
 Giles, H. A. C., Osborn, H. P., BlancoCuaresma, S., et al. 2018, A&A, 615, L13 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
 Gilliland, R. L., Chaplin, W. J., Dunham, E. W., et al. 2011, ApJS, 197, 6 [NASA ADS] [CrossRef] [Google Scholar]
 Gilliland, R. L., Chaplin, W. J., Jenkins, J. M., Ramsey, L. W., & Smith, J. C. 2015, AJ, 150, 133 [NASA ADS] [CrossRef] [Google Scholar]
 Harvey, J. 1985, ESA SP, 235, 256 [Google Scholar]
 Harvey, J. W. 1988, IAU Symp., 123, 497 [NASA ADS] [Google Scholar]
 Heinisch, O. 1962, Biometrische Zeitschrift, 4, 207 [CrossRef] [Google Scholar]
 Jiménez, A., Roca Cortés, T., & JiménezReyes, S. J. 2002, Sol. Phys., 209, 247 [NASA ADS] [CrossRef] [Google Scholar]
 Jiménez, A., JiménezReyes, S. J., & García, R. A. 2005, ApJ, 623, 1215 [NASA ADS] [CrossRef] [Google Scholar]
 Kallinger, T., De Ridder, J., Hekker, S., et al. 2014, A&A, 570, A41 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
 Kallinger, T., Hekker, S., García, R. A., Huber, D., & Matthews, J. M. 2016, Sci. Adv., 2, e1500654 [Google Scholar]
 Karoff, C. 2012, MNRAS, 421, 3170 [NASA ADS] [CrossRef] [Google Scholar]
 Kipping, D. M. 2010, MNRAS, 408, 1758 [NASA ADS] [CrossRef] [Google Scholar]
 Kipping, D. M., Bastien, F. A., Stassun, K. G., et al. 2014, ApJ, 785, L32 [NASA ADS] [CrossRef] [Google Scholar]
 Kjeldsen, H., & Bedding, T. R. 1995, A&A, 293, 87 [NASA ADS] [Google Scholar]
 Lendl, M., Cubillos, P. E., Hagelberg, J., et al. 2017, A&A, 606, A18 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
 Lendl, M., Bouchy, F., Gill, S., et al. 2020, MNRAS, 492, 1761 [NASA ADS] [CrossRef] [Google Scholar]
 Li, T. 2014, Time Series with Mixed Spectra (Boca Raton, Florida: CRC Press) [Google Scholar]
 Mandel, K., & Agol, E. 2002, ApJ, 580, L171 [NASA ADS] [CrossRef] [Google Scholar]
 MarcosArenal, P., Zima, W., De Ridder, J., et al. 2014, A&A, 566, A92 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
 Mathur, S., Hekker, S., Trampedach, R., et al. 2011, ApJ, 741, 119 [NASA ADS] [CrossRef] [Google Scholar]
 McQuillan, A., Aigrain, S., & Roberts, S. 2012, A&A, 539, A137 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
 Meunier, N., Lagrange, A. M., Borgniet, S., & Rieutord, M. 2015, A&A, 583, A118 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
 Morris, B. M., Bobra, M. G., Agol, E., Lee, Y. J., & Hawley, S. L. 2020, MNRAS, 493, 5489 [Google Scholar]
 Moya, A., Barceló Forteza, S., Bonfanti, A., et al. 2018, A&A, 620, A203 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
 Muller, R., Hanslmeier, A., Utz, D., & Ichimoto, K. 2018, A&A, 616, A87 [Google Scholar]
 Nesis, A., Hammer, R., Roth, M., & Schleicher, H. 2002, A&A, 396, 1003 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
 Nordlund, Å., Stein, R. F., & Asplund, M. 2009, Liv. Rev. Sol. Phys., 6, 2 [Google Scholar]
 Oshagh, M. 2018, Asteroseismology and Exoplanets: Listening to the Stars and Searching for New Worlds (Berlin: Springer), 49, 239 [NASA ADS] [CrossRef] [Google Scholar]
 Pallé, P. L., et al. 1999, ASP Conf. Ser., 173, 297 [NASA ADS] [Google Scholar]
 Pande, D., Bedding, T. R., Huber, D., & Kjeldsen, H. 2018, MNRAS, 480, 467 [NASA ADS] [CrossRef] [Google Scholar]
 Pereira, F., Campante, T. L., Cunha, M. S., et al. 2019, MNRAS, 489, 5764 [NASA ADS] [CrossRef] [Google Scholar]
 Ploner, S. R. O., Solanki, S. K., & Gadun, A. S. 2000, A&A, 356, 1050 [NASA ADS] [Google Scholar]
 Rasmussen, C. E., & Williams, C. K. I. 2005, Gaussian Processes for Machine Learning (Adaptive Computation and Machine Learning) (Cambridge, MA: The MIT Press) [Google Scholar]
 Rauer, H., Catala, C., Aerts, C., et al. 2014, Exp. Astron., 38, 249 [NASA ADS] [CrossRef] [Google Scholar]
 Ricker, G. R., Winn, J. N., Vanderspek, R., et al. 2015, J. Astron. Teles. Instrum. Syst., 1, 014003 [Google Scholar]
 Salabert, D., Garcia, R. A., Palle, P. L., & Jimenez, A. 2011, J. Phys. Conf. Ser., 271, 012030 [CrossRef] [Google Scholar]
 Salabert, D., García, R. A., Jiménez, A., et al. 2017, A&A, 608, A87 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
 Savitzky, A., & Golay, M. J. E. 1964, Anal. Chem., 36, 1627 [NASA ADS] [CrossRef] [Google Scholar]
 Schou, J., Scherrer, P. H., Bush, R. I., et al. 2012, Sol. Phys., 275, 229 [Google Scholar]
 Scuflaire, R., Théado, S., Montalbán, J., et al. 2008, Ap&SS, 316, 83 [NASA ADS] [CrossRef] [Google Scholar]
 Seleznyov, A. D., Solanki, S. K., & Krivova, N. A. 2011, A&A, 532, A108 [NASA ADS] [CrossRef] [EDP Sciences] [Google Scholar]
 Shapiro, S. S., & Wilk, M. B. 1965, Biometrika, 52, 591 [Google Scholar]
 Simon, M. K. 2006, Probability Distributions Involving Gaussian Random Variables: A Handbook for Engineers, Scientists and Mathematicians (Berlin, Heidelberg: SpringerVerlag) [Google Scholar]
 Stein, R. F., & Nordlund, A. 1989, ApJ, 342, L95 [NASA ADS] [CrossRef] [PubMed] [Google Scholar]
 Svensson, F., & Ludwig, H.G. 2005, ESA SP, 560, 979 [NASA ADS] [Google Scholar]
 Title, A. M., Tarbell, T. D., Topka, K. P., et al. 1989, ApJ, 336, 475 [NASA ADS] [CrossRef] [Google Scholar]
The definition based on Eq. (1) differs slightly from Bartlett (1950), who proposed to split a given time series X(t) into L subseries X_{ℓ}(t).
Useful notes: “Gaussian” and “colored” are unrelated properties. A random process is Gaussian if it is normally distributed (i.e., Gaussian probability distribution function). If the process is white, the covariance matrix is the Identity matrix (and the PSD is constant); if it is colored, the covariance matrix is not Identity (and the PSD is not constant in frequency). (Secondorder) Stationarity means that secondorder statistics (i.e., the correlation structure) do not change in time.
We made use of the function “convolution.Box1DKernel” available from the Python package http://www.astropy.org
We note that the existence of intermediate scales of convective motions (<3 × 10^{7} m), known as mesogranulation, is controversial even for the Sun (Stein & Nordlund 1989; Ploner et al. 2000).
Taken in the Kepler_stellar17.csv.gz catalog, https://archive.stsci.edu/kepler/catalogs.html
Available from https://www.cosmos.esa.int/web/cheopsguestobserversprogramme/ao1
All Tables
All Figures
Fig. 1
Yearly RMS of the VIRGO time series for the blue, green, and red SPM channels (see colors). Solar minima at the beginning ofcycles 23 (from 1996 to 2008) and 24 (since 2009) are clearly visible. 

In the text 
Fig. 2
Averaged periodograms computed with Eq. (1) using all oneday subseries available during a minimum (2008, solid dashed lines) and a maximum (2003, solid solid lines) of the solar cycle. Each color indicates the observation of one SPM channel. Differences with the solar cycle are observed at frequencies ν < f_{l} (i.e., at periods >6.2 h). The peak at ν = 5570 μHz affecting the PSD of the blue and green SPM data is an electronic artifact related to the calibration period used by the acquisition system of VIRGO. The gray shaded area indicates the frequency range corresponding to the typical duration of planetary transit (from ~ 25 min to several hours). 

In the text 
Fig. 3
Example of a oneday VIRGO time series (top) detrended (bottom) using a SG filter with a window size of 15 h (colored solid lines). From left to right: dataset from the blue, green, and red SPM channels. 

In the text 
Fig. 4
From left to right: empirical distribution of the RMS, amplitude range, and F8 measurements evaluated on a set of oneday VIRGO subseries taken during a solarcycle minimum (2008, colors are related to the three SPM channels). The empirical distributions obtained during a solarcycle maximum (2003) are shown by the black contours. Distributions derived using the HMI data described in Sect. 3 are shown by the yellow histograms.In each panel, the vertical dashed line indicates the transit depth expected for an Earth analogue (84 ppm). 

In the text 
Fig. 5
Effect of data binning on the RMS of oneday solar subseries (1996, black), on the synthetic times series generated with Eq. (3) (yellow), and on synthetic subseries of WGNs (blue). The RMS of the WGN series decreases as (light blue). Solid lines indicate the median values of the observed dispersion (shaded areas). The RMS of the time series at large bin sizes shows a significant dispersion as the number of data points decreases. Horizontal dotted lines indicate the fraction of a typical Earthlike transit depth (i.e., 84 ppm). 

In the text 
Fig. 6
Estimation of the splitting frequencies. Main panel: L = 364 ACFs corresponding to the available solar oneday subseries observed in 1996 with the blue SPM channel (gray lines) and their mean ACF (black). The mean ACF allows the automatic derivation of the upper and lower frequencies surrounding the acoustic mode regime {f_{h}, f_{c} }. Inset panel: logarithm representation of the associated PSD estimates computed using Eq. (1). The PSD allows us to localize the flicker frequency {f_{g} }. The lower frequency {f_{ℓ}} is set by the window length of the SG filter applied to each oneday subseries to remove the longterm variability. In both panels,the splitting frequencies are indicated by the dotted vertical red lines. The blue dotted line represents the slope of the PSD (see Eq. (2)) measured in the frequency region associated with the granulation regime ν ∈ [f_{g}, f_{c}]. From high to low frequency, the distinct regions are denoted by letters A to D. 

In the text 
Fig. 7
Evolution of the inferred power index estimated on averaged periodograms of oneday solar subseries. From left to right: index evaluated for (a) the highfrequency region, (b) the pmodes region, (c) the granulation region, and (d) the lowest frequency region. Each color corresponds to an SPM channel. In region (c), the index obtained on the HMI solar observations discussed in Sect. 3.1 is shown by the horizontal black line (with the 1σ uncertainties in gray). This index results from the averaged periodogram computed using the 91 oneday subseries that have been selected during the period of a solar minimum (2017–2019). 

In the text 
Fig. 8
Top: example of a raw solar light curve extracted from HMI images (black) and the corresponding smooth detrending function based on SG filters with a window size of 5 h (red). An artificial transit of a 5 R_{⊕} planet crossing the center of the solar disk (b = 0) is shown in gray. Bottom: residuals of the raw solar light curve after correcting by the SDO satellite motion. 

In the text 
Fig. 9
Example of artificial exoplanet transit light curves generated using solar HMI observations (quiet Sun, 20181210). Each panel shows five transits for planets with sizes ranging from 1 to 10 R_{⊕} (see legend). Panels from left to right: different orbit configurations (see the panels’ header). 

In the text 
Fig. 10
Top: example of artificial transit of an Earthsized planet crossing the disk center of the Sun (b = 0, black). The transit model with the true input parameters is shown in green and the model computed using the inferred parameters in red. The error on R_{p} ∕R_{s} is around 2% in this example. Bottom: residuals based on the inferred transit model. 

In the text 
Fig. 11
Distribution of the absolute percentage error on the planettostar radius ratio inferred from our simulated transits for planets with sizes R_{p} = [1, 3, 5] R_{⊕} (left to right) and impact parameters b = 0 (top) and b = 0.8 (bottom). Inferred errors on radius ratios derived from light curves containing only WGN are shown in red. 

In the text 
Fig. 12
Planettostar radius ratio and 1σ uncertainties inferred from the MCMC analyses performed on the artificial light curves of exoplanets of size R_{p} = 1 R_{⊕} (left) and R_{p} = 10 R_{⊕} (right). The case of b = 0 is shown in black and b = 0.8 in red. The true radius ratios are indicated by the horizontal dotted lines. 

In the text 
Fig. 13
From top to bottom: distribution of the absolute percentage errors on t_{d}, T_{0}, and b for the whole set of artificial transit light curves. 

In the text 
Fig. 14
Example of an averaged periodogram of the Kepler target KIC 7940546 (M_{s} = 1.152 M_{⊙}, R_{s} = 1.807 R_{⊙}, T_{eff} = 6244 K, m_{v} = 7.397). The number of available oneday subseries for this target is L = 75. Cutoff frequencies f_{c} and f_{g} resulting from the MCMC analysis are indicated by the vertical dashed lines. 

In the text 
Fig. 15
Selected Kepler targets represented as a function of their mass and radius. The color code indicates the inferred index parameters in the frequency region of granulation (α_{g}) derived aftercorrection by the HF noise. The best targets with R^{2} > 0.9 are shown in color (see Sect. 4.3). 

In the text 
Fig. 16
Estimated values of the cutoff flicker frequency f_{g} as a function of stellar parameters resulting from the MCMC analyses performed on periodograms of the selected SC Kepler targets. From left to right: stellar mass, radius, effective temperature, surface gravity, and apparent magnitude. Black dots represent targets with magnitude m_{v} ≤ 10 and gray dots targets with magnitude m_{v} > 10. The star symbol shows the solar cutoff frequency evaluated from VIRGO data (see Sect. 2.4). Pearson (ρ_{P}) and Spearman’s (ρ_{S}) coefficients, evaluated for stars with m_{v} ≤ 10, are indicated in each panel. 

In the text 
Fig. 17
Estimated flicker index associated to granulation (α_{g}) as a function of stellar parameters. From left to right: stellar mass, radius, effective temperature, surface gravity, apparent magnitude, and normalized ν_{max} resulting from (4). Top: values obtained from the MCMC analyses. The color code indicates the apparent magnitude of the target: m_{v} < 10 (black) and m_{v} > 10 (gray). The star symbol represents the index derived for the Sun based on VIRGO green channel observations: α_{g} = 1.26 with σ_{W} = 5 ppm. The square symbol represents the index obtained after adding a HF noise level (corresponding to that seen in Kepler observations of the Sunlike star KIC 3427720) to the VIRGO subseries. Bottom: values obtained after interpolation that corresponds to a HF level of σ_{W} = 30 ppm (see Sect. 4.3). The color code indicates here the data with R^{2} > 0.8 (i.e., the best fits, black) and R^{2} > 0.5 (blue). The triangle symbol represents the value of the flicker index derived by adding a WGN of σ_{W} = 30 ppm in solar VIRGO observations, for which we find: α_{g} = 0.9464. This is the raw level of HF noise we expect for Sunlike stars observed with CHEOPS. Pearson (ρ_{P}) and Spearman’s (ρ_{S}) coefficients associated with these plots for the best targets (R^{2} > 0.9) are indicated on each panel. 

In the text 
Fig. 18
Correlations between ν_{max} (determined using Eq. (4)), the corner frequency, and the flicker frequency. The corner and flicker frequencies have been derived for each Kepler target using the MCMC analysis described in Sect. 4.2. The color code indicates the apparent magnitude of the target: m_{v} < 10 (black) and m_{v} > 10 (gray). Solar values derived from VIRGO observations are shown by the yellow star in each panel. Pearson (ρ_{P}) and Spearman’s (ρ_{S}) coefficients associated with these plots are indicated on each panel. 

In the text 
Fig. 19
Estimated values of the flicker index as a function of the HF noise level added in the VIRGO time series (red, blue and green SPM channels). Symbols show the power index measured on the PSD of Kepler Sunlike stars listed in Table C.1. 

In the text 
Fig. 20
Flicker index α_{g} as a function of the level of HF noise (black) for the Fstar KIC 7940546. The first value, computedfrom raw Kepler observations, is indicated by the red dot and corresponds to α_{g} = 1.23 and σ_{W} = 69 ppm. The gray line shows the interpolated function (see Eq. (5)), for which we obtained a quality factor of R^{2} = 0.96. The interpolated index at σ_{W} = 30 ppm is α_{g} = 1.69 (blue triangle). 

In the text 
Fig. 21
Coefficients {a, b, c} involved in Eq. (5) as a function of the Kepler stellar parameters. Red curves represent the linear and quadratic functions described in Eq. (6). 

In the text 
Fig. 22
Illustration of the limiting stellar apparent magnitude, depending on the stellar parameters, (radius and mass) that is needed to measure a flicker index with α_{g} > 0.2 with the future CHEOPS (left) and PLATO (right) highprecision observations. 

In the text 
Fig. A.1
Distributions of the percentage error on the transit depth (δ) shown for the artificial transits generated using one solar time series (20181210). Each panel represents the errors for a different planet size (R_{p} = 1, 3, 5, 7, 10 R_{⊕} from top to bottom, resp.) and impact parameter (see legend). The temporal evolution of the ratio of covered to uncovered numbers of pixels has been measured in transit (δ_{in}). 

In the text 
Fig. B.1
Top: example of a raw solar dataset (black) and mean of the predictive distribution of the GP model (red). Bottom: residuals of the solar dataset corrected by the GP model. The standard deviation of these residuals is used as input synthetic errorbars in our artificial transit light curves. 

In the text 
Current usage metrics show cumulative count of Article Views (fulltext article views including HTML views, PDF and ePub downloads, according to the available data) and Abstracts Views on Vision4Press platform.
Data correspond to usage on the plateform after 2015. The current usage metrics is available 4896 hours after online publication and is updated daily on week days.
Initial download of the metrics may take a while.