Back to simulating star formation histories

I’ve decided to take a break from real galaxy data and run some simulations of synthetic galaxy spectra. The main motivation was to help make an informed decision about which subset of the ProGeny based SSP libraries to use: the ones based on the MIST isochrones have ages uniformly logarithmically spaced at 0.05 dex intervals and 15 metallicity bins. Most of the latter are considerably sub-solar and aren’t likely to be major contributors to the light of nearby massive galaxies. For possibly metal poor dwarf galaxies it’s always possible to pick a metal poor subset — I have a subset of EMILES that I’ve used for that purpose for some time. More important to me is the choice of age bins. As I mentioned in the last post I did model runs with both 42 (0.1 dex age intervals) and 83 ages, with SSP models younger than log(T) = 6 and older than the currently accepted age of the universe excluded. Since execution time of a Stan model is roughly proportional to the number of parameters and these models are rather time consuming I’d like to know if the coarser age spacing has sufficient resolution. I will show below why it’s correct to exclude the very youngest populations.

Other reasons to do these kinds of exercises are to test the accuracy of retrieval of key quantities in idealized conditions where model assumptions are met by the inputs, to compare different model libraries, or to compare different modeling codes (something I don’t plan to do at present). Performing simulations is de rigueur in the SFH modeling industry and there are many examples in the literature, both as appendices and standalone papers — to name one the second of the two ProGeny papers I linked last time falls in this category. I’ve even done some limited simulations as far back as 2018, but the only case I wrote about was a very simple one.

This time I’ve decided to be more systematic and at least slightly more realistic. Since I remain interested in post-starburst galaxies I am creating simulations with two components: a base population that begins forming stars at the beginning of the universe, which I take to be a lookback time of \(10^{10.125}\) yr. = \(13.36\) Gyr. The second component is a “burst” population that commences star formation at a lookback time of \(\mathsf{t}_b\) with a relative strength of \(\mathsf{f}_b\). The base population has solar metallicity (Z = 0.02) while the burst is slightly subsolar (log(Z/Z) = -0.25). This obviously isn’t meant to represent a realistic model for chemical evolution — the purpose again is to test the ability to recover the input stellar metallicities in idealized circumstances. In keeping with my usual attitude of treating this as an afterthought I haven’t gotten around to exploring this issue yet.

Both the base and burst populations have time varying star formation rates proportional to Gamma probability distributions. The Gamma distribution is a two parameter family of univariate functions that can have a fairly wide variety of shapes, all of which have a single mode at \(t \ge 0\) followed by a monotonic decline asymptotically approaching 0. See the table below for the most salient properties. I don’t know of any real astrophysical justification1Abramson et al. (2016) argued for a log-normal form for the evolution of the cosmic SFR density and suggested that this describes the evolution of individual galaxies as well. More recently Katsianis, Yang, and Zheng (2021) argued for a gamma distribution fit to the evolution of the CSFRD and again claimed with some physical motivation that this form holds for individual galaxies. I was unaware of the latter work when I started this exercise. for this choice but it’s notable that it includes two of the most popular parametric forms for modeling star formation histories: exponentially declining (\(\alpha = 1\)), and what’s generally called the “delayed-\(\tau\)” model (\(\alpha =2\)) in the literature. Note that using a probability distribution has nothing to do with probability per se, but a probability density has two useful properties: it’s non-negative on its support and has a finite integral (equal to one when properly normalized).

PropertyValue
Parametersshape \(\alpha \ge 0\)
scale \(\theta > 0\)
Density function\(f(t) = \frac{1}{\Gamma(\alpha)\theta^\alpha} t^{(\alpha-1)}\mathsf{e}^{-t/\theta}~~t \ge 0\)
Mean\(\alpha\theta\)
Mode\((\alpha -1)\theta~~\alpha\ge1\)
\(0~~~~\alpha \le 1\)
Variance\(\alpha \theta^2\)
Source: Wikipedia

Given shape and scale parameters the mass in each time bin is proportional to the difference in the cumulative gamma distribution at the beginning and end of each interval. The star formation history for each component is just the mass formed in all age bins, with the burst component multiplied by the burst strength. These are scaled to sum to 1, and the spectra are straightforwardly calculated as matrix products of the input SSP model fluxes with the mass histories. The “flux” values are multiplied by 105 for the sole purpose of making them approximately unit scaled. This has the effect of making the total mass born in the simulated “galaxy” equal to 105 M. The fluxes are interpolated to the same logarithmic wavelength grid used by SDSS, convolved with a gaussian to simulate stellar velocity dispersion, then truncated to the wavelength range (3621 Å, 9000 Å). The velocity dispersion is a settable parameter, but so far I have only used 140 km/sec. The base and burst spectra, which were calculated separately for visualization purposes are added, then multiplied by Calzetti attenuation with a user selected optical depth, then finally perturbed with Gaussian random noise. The flux variances are Poisson-like, that is they are proportional to the “counts” in each wavelength bin and scaled to a target average signal to noise. Again this is a settable parameter, but so far I have used a target SNR of 40, which is fairly typical for a fiber spectrum near the center of the IFU of a bright MaNGA galaxy. So far at least I do not add emission lines, but they are allowed in the models.

The subsequent modeling workflow is essentially the same as I use with real data, with some very minor tweaks reflecting the fact that the flux values are arbitrarily scaled. The Stan code is exactly the same as my current working version. Model runs use the same number of warmup and sampling iterations as my real data runs (250 and 750 respectively with 4 independent chains). Post processing is also essentially the same. I extract the posterior predictive fits to the spectra, the model star formation histories, the optical depth of attenuation and the reddening curve slope parameter. I compute the present day stellar mass, the 100 Myr averaged star formation rates, and specific SFR. The stellar mass and SFR are arbitrarily scaled but would scale by the same factor if these were real data from a distant galaxy, so the SSFR would be the same.I also track a few observables: the 4000 Å break index Dn(4000) and HδA.

This time I am exploring a wider range of star formation histories than my previous effort, which only examined a case or two with instantaneous bursts. With seven parameters (plus two more that I’ve held constant so far) it isn’t really practical to do a comprehensive grid of models, so for the most part I have randomly selected parameter values in the ranges shown below. At the time of writing I’ve run a few more than a hundred models. Most of these use the “small” version of the ProGeny C3K base SSP models as the test library, with a few using the input library as the test. I’ve also done a few with the “large” version (see the previous post for details of the ingredients). I plan to do some simulations with the other model libraries I’ve used, especially EMILES, but that’s somewhat of a different project. For now I’m examining the accuracy of outputs for the idealized case where the inputs satisfy the assumptions of the model.

\( \alpha_0 \in [1, 2] \)
\(\theta_0 \in [2, 6] \)
\(\mathsf{t}_b \in [0.5, 2]\)
\(\mathsf{f}_b \in [0.1, 0.75]\)
\( \alpha_b \in [1, 2] \)
\( \theta_b \in [0.01, 0.25] \)
\( \tau_V \in [0.0, 0.5] \)

Here is a cherry-picked example of a fairly strong and recent burst, using the small C3K based library as the test. First is the “observed” spectrum and the replicated values from the posterior (top pane), and the posterior residuals (bottom pane). The residuals look very much like — and pass standard statistical tests for — Gaussian white noise. This is rather different than I usually see with real data, of which a number of examples can be found by scrolling down through previous posts, where there are often large regions of spectra that are systematically misfit. This reflects the fact that all model assumptions are met by the data by construction, so fitting the data is expected.

Simulated spectrum and “posterior predictive” fit and residuals.

The model star formation history from the same simulated data contains some surprises, and these turn out to be fairly generic features of most model runs. This is a bit of a new graphical display for me. I’ve put the star formation rate history above the mass growth history with the same age scale for each so I can get different views of the same sequence of events. The input model parameters are on a line in between. The blue lines are the input star (mass) formation history. The red line in the second plot is from the preliminary maximum likelihood fit. The black lines and grey ribbons are the marginal means and 95% confidence intervals. Reading from right to left there are several points to note:

  1. The details of the pre-burst SFH are largely lost, although the total mass formed in the model is roughly the same.
  2. The ramp up to the peak of the burst begins sooner and is more gradual than the input. Note that both the input SFR and mass growth are well outside the model confidence intervals in the ramp up period.
  3. Most significantly in my mind at least, the peak of the burst is much later (more recent) in the model than the input. This turned out to be a fairly generic feature of these models.
  4. In this run the post-peak decay is rather steeper on average than the input. In model runs with a rapid input decay the model will usually have a more gradual one.
  5. The input and model SFR eventually converge and the late time model SFH agrees very well with the input, except
  6. There’s an odd upturn in SFR at the very youngest ages (< 3 Myr). This happens to varying degrees in every simulation model run and in all the real data I’ve looked at. It turns out there’s a simple explanation for this. There is virtually no spectroscopic evolution in the relevant wavelength range in the first few Myr of a coeval population’s life. Since the spectra are indistinguishable there is no way to constrain the individual contributions without a more informative prior, and the models are correctly estimating the same marginal distributions for the youngest few populations. Since the time intervals get shorter with decreasing age this produces an apparent upturn in SFR.
Black line and grey ribbon: model results for SFH and MGH. Blue line: input. Red line: MGH from maximum likelihood fit.
ProGeny generated SSP model spectra — C3K base models – ages 1-10 Myr, solar metallicity
ProGeny generated SSP model spectra — C3K base models – ages 1-2 Myr, solar metallicity. Spectra are vertically offset.

There are two surprises in the simulations I’ve run so far. First is that, like the example above, the time of peak star formation in the model burst tends to be biased downwards (more recent) from the input. The bias gets larger in magnitude with increasing lookback time to burst onset and can be as large as several hundred Myr as seen below. The few outliers here were weak bursts with very large variations in the time of maximum SFR.

Bias in time of peak SFR vs. input burst onset time

The other surprise is that the 95% confidence bands of the model star formation histories, whether viewed as SFR or mass growth histories, fail to cover the inputs for at least some epochs around the time of the burst. In other words the models are saying that the actual input star formation histories are highly unlikely. Why is this?

It’s not, I think, because of any convergence failure. Stan is quite aggressive at flagging potential issues and there were virtually none in any of the model runs. I also don’t think it’s because the model runs were too short. I’ve occasionally gathered much larger samples with longer warmup times as well and except for exploring the tails of the distributions a little more thoroughly there is little change in marginal 95% confidence intervals.

Most of the model runs were done with the reduced size SSP library with just half of the available age bins (0.1 dex intervals). This doesn’t seem to have reduced the real effective age resolution, but in the small number of runs I did with the full set of ages the marginal contributions have larger confidence intervals, so this could increase the coverage.

I suspect though that the prior on the star formation history is having a larger effect than I would have suspected. The main clues here are the maximum likelihood fits. which often produce mass growth histories (the red lines in the plot above and the ones below the fold below) that are rather close to the input MGH, also falling outside the 95% model confidence bands for at least some epochs. In my current workflow the slightly perturbed maximum likelihood solution is used to initialize the Stan model. In most of the model runs the samples quickly move away from the initial values. While the fits to the data are good in all model runs the summed log-likelihood of the sample spectra is always less than the maximum likelihood solution, in the sample shown below by about 7%.

Log-likelihood of posterior samples and maximum likelihood solution for a simulated model run

Below the fold I’ve posted some more cherry picked examples of model star formation histories. A few general comments about them:

  1. The late time star formation rate is generally well captured by the models.
  2. The ancient (pre burst) star formation history is sometimes captured well with weak bursts. Particulary noteworthy is the second example below, which has a recent and weak burst with a gradual ramp up and decay. The entire star formation history is modeled accurately. In the literature there are sometimes references to “outshining” or a “dazzle” effect in (post)-starburst galaxies. This seems to be a real thing.
  3. Early time bursts aren’t necessarily recognizable as such. In some cases the model just shows an extended period of star formation followed by more or less gradual quenching. I believe this is consistent with some claims in the literature, but I don’t care to look for the relevant papers right now.

I’ll conclude with a look at results for all simulation runs (so far) for some of the summary quantities I track. First, the present day stellar mass is on average unbiased but in individual runs there are errors as large as ±0.1 dex. The typical posterior uncertainties are only ~0.01 dex, so the models are consistently too optimistic. I’ve noted this in real data, which shows similar nominal uncertainties with moderately high S/N data.

Not shown here but I did notice a weak dependence on the input burst onset time or time of maximum SFR in the sense that recent bursts tend to underestimate the stellar mass. This is another possible manifestation of the “outshining” effect.

Error in model stellar mass vs. input stellar mass

The most dramatic bias is a tendency to overestimate the specific star formation rate when the input SSFR is low. This is also something I’ve seen in real data: as I’ve noted a time or two even the reddest and deadest of ellipticals seem to have a floor of about -11.5 for the estimated SSFR.

One thing I didn’t think about until finishing this analysis is that the contribution estimates tend to be skewed to the right when they’re near 0, so the median might be a better estimate of central tendency than the mean. That might make a (probably) small difference. I will look into this.

Error in model specific star formation rate vs. input SSFR

Finally, there’s a nearly linear relation between the error in the posterior estimate of the optical depth of attenuation and the input. Note also the stratification of the reddening slope parameter δ: at relatively high input optical depths the largest departures have the reddest attenuation curves. I think this behavior is directly related to the prior. Since I don’t think the attenuation is very well constrained by the data I use fairly informative priors on both the optical depth and slope parameter; both priors are truncated normals with a lower bound of 0 for τV and a range of (-0.5,1.2) for δ. This is an easy thing to test simply by changing the priors for these parameters.

Mean and standard deviation of model optical depth – input optical depth vs. input optical depth

I am not quite finished with these simulations. I want to expand the range of input parameters a little bit, and I need to look a little more carefully at subsets of the ProGeny SSP models. The main result so far is that my hopes of timing key events in post-starburst galaxies’ histories were probably too optimistic.

Continue reading “Back to simulating star formation histories”

ProGeny: The solution to my SSP model problems?

I’ve written a few times about my dissatisfaction with the EMILES based SSP model library that I’ve used for star formation history modeling for some years and my desultory search for a replacement: This post from early 2024 was the most detailed. Unfortunately neither of the two alternatives I tested were quite satisfactory and my hopes for a MaStar based library from the final MaNGA data release haven’t yet materialized nearly a year later. I’ve given some thought to producing my own but I don’t know exactly how to create sets of summed spectra from a collection of isochrones and given IMF, and it wasn’t really something I wanted to learn. It’s also the case that even though MaStar has many more stellar spectra than any other publicly available collection (MILES, which is the basis for most (semi-)empirical libraries contains just 985) it still doesn’t fully span the space of physical parameters, so additional theoretical or empirical inputs will be required.

Somewhat serendipitously since I was traveling at the time and could easily have missed them a pair of papers appeared on arxiv on 23 October describing an R package named “ProGeny” that does precisely what I need: 2410.17697 and 2410.17698 by A. Robotham and S. Bellstedt. The first paper introduces the package and describes the data currently available for generating what they call Stellar Population Libraries (SPL). While it might seem surprising they used R for this1Python is the language of choice for most astronomers these days. the senior author has published several R packages for various astronomy related tasks, so it isn’t actually surprising that given their apparent need for this tool they chose R to implement it. This was a timely publication for me since I’ve about reached the limit of what I want to do with the EMILES library.

Besides the software which is written entirely in R, the authors did a good job of collating the publicly available data for the required inputs of isochrones and stellar atmospheres and packaging them into FITS files that are readable in R with the help of addon packages. Functions within ProGeny take care of reading the files and creating R objects for further processing.

After installing the software and dependencies and downloading the data files (several GB unzipped) I spent a little time browsing the (rather sparse) documentation and the two vignettes that have been published so far and it was then pretty straightforward to create a library with the recommended inputs. Specifically I used the MIST isochrones combined with the following stellar atmospheres: “C3K” for the base, “PHOENIX” extended, “AGB” from Lancon, “TMAP” for hot stars including remnants: see Table 3 of the first paper for descriptions and references. I used a Kroupa IMF with lower and upper mass limits of 0.1 and 100 M, which are fairly standard choices and the same as my EMILES based library. The constructed library has 107 age bins in logarithmic intervals of 0.05 dex ranging from log(T) = 5.0 to 10.3, 15 metallicity bins with log(Z/Z) ranging from -4 to +.5 with adopted Z = 0.02, and wavelengths from 5.6 Å to 36 mm. Almost all of the data that I use as model inputs is calculated, including of course the spectra and remaining stellar mass at each age and metallicity. The only thing missing is relative g, r, i band fluxes that I use to calculate model intrinsic colors (something I occasionally look at as a sort of proxy for specific star formation rate). This I have existing code for.

These are more age/metallicity combinations than I need or probably can use and definitely a much larger wavelength range than needed for MaNGA data, so I’m experimenting with subsets. For a first pass I created a “small” subset with 42 ages ranging from log(T) = 6 to 10.1 in 0.1 dex increments and 5 metallicities log(Z/Z) = {-0.75, -0.25, 0, 0.25, 0.5}. The metallicity range is similar to my EMILES based library but with one more bin, while there are fewer but more evenly spaced age bins. I’ve also created a “large” subset with all 83 age bins between log(T) = 6 and 10.1, and the same 5 metallicities. I didn’t include younger populations for the simple reason that there’s not much evolution in the first Myr, which makes it difficult or impossible to disentangle their contributions. Similar considerations apply at the old end, and in addition there’s the fact that the current consensus age of the universe is 13.8 Gyr (log(T) = 10.14).

I’ve tentatively chosen a wavelength range of 3300-9000Å. The blue end was chosen to cover the shortest rest frame wavelength in MaNGA spectra for redshifts out to z ≈ 0.1. The red limit is more of a judgment call. Night sky lines become problematic at wavelengths as short as ≈ 8000Å, and in addition there’s an artifact called “blowtorch” by the SDSS team that causes an upturn at the very red end in some spectra. The chosen cutoff is just to the red of a set of Calcium triplet absorption lines at 8500-8700Å that are prominent in some spectra.

Below are comparisons of a few model spectra at solar metallicity and decade age intervals. Recall that the 106 and 107 year models are from the Hr-PyPopStar models of Millan-Irigoyen et al. (2021) including continuum emission. Evidently the newly constructed models are rather more luminous and bluer than the PyPopstar models, while closer to the EMILES models especially at ages > 1 Gyr. Possible differences in absorption line strengths could be due to finer wavelength sampling in the ProGeny models. One possibly significant difference is the MIST isochrones track more stages of stellar evolution than the version of BaSTI used for EMILES, including the asymptotic giant branch and stellar remnants. How much these contribute to the light in the relevant wavelength range and at what ages has yet to be investigated by me.

Sample SSP model spectra – ProGeny generated from c3k base models and EMILES + Pypopstar

Modern collections of stellar isochrones generally include a prescription for mass loss and remnant masses in evolving populations. ProGeny calculates these and includes a data table of stellar masses in its SSP model output. The graph below compares the remaining stellar mass (including remnants) with age for the provisional library versus EMILES. I had assigned a relative stellar mass of 1 to the PyPopstar models, while the MIST isochrones indicate about a 10% mass loss by 107 years. At older ages the masses are very similar. This is much closer agreement than the equivalent graph in Figure 12 of paper 1, so the adopted relation shouldn’t be a source of significant differences in present day stellar mass computations.

Stellar mass vs. age – MIST vs. BaSTI isochrones at several metallicities

So far I’ve only run models for two galaxies with rather different properties, the same ones I wrote about in this post: the post-starburst plateifu 10220-3703 (mangaid 1-201936; NED name WISEA J080218.38+323207.8) and the late type spiral plateifu 11018-12704 (mangaid 1-233951; UGC 8162). Since it has taken much longer to prepare this post than I anticipated, I’m only going to discuss the post-starburst for now. The spiral will have to wait for the new year.

I did an initial set of runs with the same model formulation that I’ve used for a few years now and got some rather surprising results, particularly for outlying regions of the galaxy: model star formation histories looked more like an actively star forming galaxy than a passively evolving one, with younger populations and higher recent star formation rates. The models also had relatively high attenuation values with much steeper attenuation curves than are reported in the literature. This is I think a known “degeneracy” between attenuation and stellar age. The tentative solution was simply to tighten the priors on the two attenuation parameters: I set both upper and lower bounds on the steepness parameter δ: -0.5 < δ < 1.2 and also decreased the scale parameters on the priors for both τV and δ (see this post for my initial discussion of the attenuation model and the following one for problems I initially encountered; the parameterization was based on the work of Salim et al. 2018). Plugging into equation 4 of Salim et al. the limits on δ correspond to 1.6 < RV < 8.7, which is still a very large range.

As can be seen from the SFH and mass growth plots below the C3K based models are fairly consistent with EMILES, although with generally higher late time star formation rates. Interestingly, the “big” models also have generally higher late time SFR than the “small” models even though they have the same stellar ingredients with just different time resolution. Both sets of models have gotten rid of the objectionable artificial jumps in star formation rates where the BaSTI models have abrupt changes in age intervals.

One fairly significant difference is that in the EMILES based models the burst appears to be strongly centrally concentrated, fading beyond about 2 kpc from the center. The C3K based models by contrast show a burst throughout. In the center the EMILES models have a complex, multipeaked evolution through the burst, while the C3K models have a fairly smooth rise and decline. Whatever hopes I had that it might be possible to time major events in possible merger scenarios seem to have been misguided., although the age and magnitude of the burst agree in all model runs.

Plateifu 10220-3703 (mangaid 1-201936) Model star formation histories for all binned spectra and 3 distinct SSP model libraries
Plateifu 10220-3703 (mangaid 1-201936) Model mass growth histories for all binned spectra and 3 distinct SSP model libraries

Here are pairwise differences in mean values between the models for some of the summary quantities I track. Some of the differences are fairly large, which just reinforces my longstanding opinion that true uncertainties are much larger than what’s captured in the posteriors of model runs.

And finally, here is a closer look at star formation history models for the two fibers closest to the center:

Plateifu 10220-3703 (mangaid 1-201936) SFH and mass growth history estimates for innermost 2 spectra and 3 SSP libraries

And here are (posterior predictive) fits to the data for the central spectrum with the large C3K based model (L) and EMILES (R). Both of these have some problematic areas, with some stretches of continuum and absorption line strengths “off” a bit. Note that the C3K model does better with the trough around 7200Å. The continuum upturn at the red end can also be seen in C3K model, which suggests I may want to consider truncating the model spectra a few hundred Ångstroms to the blue. Overall though fits to the data are very similar, and there’s little difference between the “large” and “small” C3K subsets.

Plateifu 10220-3703 (mangaid 1-201936) fits to the central spectrum with C3K and EMILES libraries

ProGeny is a useful tool and I hope the authors will be able to maintain it and add new capabilities and data sources. I still have a few things to do before resuming a modeling campaign: most importantly perhaps choosing a final subset of the full library, specifically the number of age and metallicity bins. The execution time of a model run is roughly proportional to the number of stellar contribution parameters. For this data set the average time with the EMILES library (224 SSP model spectra) was 618 sec, 576 sec for the small C3K library (210 spectra), and 1038 sec. for the large C3K library (415 spectra). That’s more time than I’d like to take. Would it be preferable to have a coarser metallicity grid, say log(Z/Z) = {-0.75, 0, 0.5} with the full 0.05 dex age spacing or a finer metallicity grid with 0.1 dex age bins? I don’t know: some fake data exercises might be appropriate at this point.

I still have some interest in using the MaStar spectra but there are some technical issues to solve. For one thing the data available on Skyserver are corrected to above the atmosphere but not for galactic extinction and are just apparent flux values rather than absolute ones. They also need to be reduced to rest frame wavelengths. ProGeny’s model spectra have a very wide wavelength range, which I assume is required to make the integrated model flux equal to the bolometric luminosity. Even though I don’t really need wavelengths outside the range of BOSS spectra I would need to extend them somehow anyway. Finally (?) the parameter coverage (temperature, gravity, metallicity) is still not quite complete, especially at for hot stars and very luminous ones so it would still be necessary to supplement the base library.

Quantifying burstiness, and another brief look at SDSS J095343.89-000524.7

One simple way to quantify the burstiness of star formation is just to estimate the average star formation rate over large time intervals divided by the average SFR over cosmic time. Of particular interest is the time interval between ~100 Myr and ~1 Gyr since this is roughly the time interval that a post-starburst galaxy is recognizable as such.

Partly because it happens to still be in my active workspace and partly because it’s really interesting I’m going to take another look at SDSS J095343.89-000524.7 (MaNGA mangaid 1-897).  This was in the post-starburst ancillary sample, selected from the catalog by Pattarakijwanich et al.

This image from the Subaru HSC-SSP survey1retrieved as a screenshot from the Legacy Survey sky browser. is much deeper than SDSS imaging and clearly shows extended tidal tails and debris, suggesting that these galaxies have been interacting for some time.

SDSS J095343.89-000524.7 (observed as mangaid 1-897). Image screenshot from Subaru HSC survey.

Moving on to various properties derived from the MaNGA spectroscopy and my SFH models with, still, EMILES based SSP models. First here are maps of stellar mass density and 100 Myr averaged star formation rate density. Note that I rebinned the spectra from two posts ago to try to capture more of the tidal tails while excluding the truly blank regions of sky. There are two clear peaks in the stellar mass density separated by a projected distance of about 11 kpc. The central stellar mass densities are nearly the same at about 108.95 M☉/kpc2 . Interestingly enough the bright white peak in surface brightness appears not to coincide with the western peak in stellar mass density, but is offset by a small amount to the north.

Note also that the highest recent star formation is offset to the north of the apparent western nucleus. I’ll look at that in more detail below.

MaNGA plateifu 10843-9101 (mangaid 1-897). Maps of stellar mass density and star formation rate density.

The ionized gas properties are rather different in the two galaxies. Below are BPT classifications using, as usual for me, just the [O III]/Hβ vs. [N II]/Hα diagnostics and Kauffmann’s classification scheme. Emission line fluxes are generally stronger in the eastern galaxy with mostly star forming line ratios. Note two spectra with “composite” line ratios are near the eastern nucleus and might therefore actually be due to a mix of stellar and AGN ionization.

MaNGA plateifu 10843-9101 (mangaid 1-897). BPT classifications from [O III]/Hβ vs. [N II]/Hα diagnostics

I calculate a few “strong line” gas metallicity estimates from standard literature sources. The one that seems to produce the most consistent estimates is the calibration of Dopita et al. (2016) based on the ratios of [N II 6548]/[S II 6717, 6731] and [N II]/Hα. The eastern galaxy shows a fairly smooth radial gradient while the west is considerably metal enriched in the region with the strongest starburst. The highest metallicity is right at the center of the IFU at the position of the bright white source.

MaNGA mangaid 1-897 (plateifu 10843-9101). Gas phase metallicity 12 + log(O/H) from strong line calibration of Dopita et al. (2016).

Let’s return to the idea I had at the top of the post to look at star formation rates in broad time intervals relative to the mean star formation rate over cosmic time. For this exploratory exercise I used just 4 bins with upper age limits of 0.1, 1.25, 2.25, and (nominally) 14 Gyr. There seems no point being too fastidious about calculating the bin widths: I just used the difference in nominal ages between the endpoints. I did take into account the lookback time to the galaxy, which for this one is about 1 Gyr (z = 0.083), so the final bin has a calculated width of 10.5Gyr. I chose to make the 3rd, intermediate age bin a rather short 1 Gyr wide to look for aging starbursts that might be missed using the typical selection criterion of strong Balmer absorption. In this case there’s no evidence of that: both galaxies seem to have had uneventful histories up until ~1 Gyr ago.

The top row of the plot below is the most interesting: there appear to have been two major bursts of recent star formation, both highly localized to the central region of the western galaxy. If the model estimate of the location of the peak stellar mass density is correct the fiber with the largest star formation excess in the 100 Myr – 1.25Gyr interval is offset just to the north and coincident with the IFU center. The more recent burst is also offset from the older one. There is a hint of recent accelerated star formation over most of both galaxies.

MaNGA plateifu 10843-9101 (mangaid 1-897). Maps of relative average SFR over the designated time intervals.

For the rest of this post I plot model fits to the spectra and star formation histories for the fibers surrounding the two nuclei. These are ordered approximately from north to south and west to east. For reference the IFU center is at (ra, dec) = (148.43291, -0.09018). The model has the peak stellar mass density in the western system at (ra, dec) = (148.4328, -0.09062). The eastern galaxy’s nucleus is at (ra, dec) = (148.4349, -0.09064).

Note below that the plots have different vertical scales. The horizontal scales are the same for both spectra and star formation histories, but at least one SFH plot is slightly misaligned.

Central region – western galaxy

Central region – eastern galaxy

In an earller post I mentioned a MaNGA related paper by Cheng et al. who found nearly 500 systems with post-starburst characteristics that fell in 3 broad categories: centrally concentrated PSB regions, ring-like, and irregularly located. Clearly any galaxy that was selected based on SDSS spectroscopy that’s not a false positive will have a central PSB region, although that of course doesn’t preclude extended post-starburst conditions. This particular galaxy appears to have a remarkably compact post-starburst region.

When time permits again I plan to look at the remaining 40 galaxies in this sample. Unfortunately the larger sample of Cheng et al. appears to have no published catalog.

More SDSS selected post-starburst galaxies in MaNGA

I haven’t given up on this topic. Just a longer than expected break.

I found two other catalogs of candidate PSBs selected from SDSS spectroscopy. First are the “SPOGs” (Shocked POst-starburst Galaxies” of Alatalo et al. (2016), with the catalog retrieved from VizieR at J/ApJS/224/38/table2. These were selected to have strong emission lines with ratios consistent with shocks as the ionizing mechanism, while also having strong Balmer absorption indicating the presence of a large intermediate age stellar light contribution.

The second was the sample of Pattarakijwanich et al. (2016) retrieved from J/ApJ/833/19/table3. This work used more traditional post-starburst selection criteria although somewhat more relaxed than for example Goto. Together these added 19 galaxies to the sample — 14 SPOGs and 5 from Patta… Together these, along with the Melnick and dePropris sample added about 1000 binned spectra to the sample.

I’m not going to say much about them for now. SDSS thumbnails are below. One thing I note is that a fairly large fraction of these appear to be normal star forming disk galaxies. Most of those, I suspect, are SPOGs. Of course since they were selected from 3″ SDSS spectra it’s entirely possible these galaxies are centrally quenched due to some feedback mechanism.

I’m still thinking about how to quantify “post-starburstiness.” Perhaps something like the stellar mass formed in a time interval like 1.1 – 0.1 Gyr.

The MaNGA post-starburst ancillary sample

I’ve decided to resume SFH modeling despite still not having a fully satisfactory SSP model library. I’m still using the EMILES + PyPopstar hybrid that has served as the stellar input for several years now. The only change I’ve made — strictly for visualization purposes — is to define the stellar ages as representing the middle of each time interval instead of the end. In star formation rate plots this has the effect of smoothing out the SFR a little bit where there are abrupt jumps in time intervals. This has no effect on the modeling process at all.

One of the MaNGA ancillary programs (PI C. Tremonti) observed a sample of 24 candidate post starburst galaxies drawn from 5 different sources (both published and unpublished) with a variety of selection criteria. In addition to these there are 7 PSB’s from the compilation of Melnick and De Propris (2013) in the primary or secondary samples that I added to the sample for a total of 31. I was able to run successful models for 30 data sets, with one having severe calibration issues that I dropped from further analysis. Altogether there were 1,399 binned spectra in the sample with as usual a large range of bins per galaxy: in this case ranging from 6 to 240.

From a diverse set of selection criteria it’s not too surprising that the sample is rather diverse too, with perhaps a few false positives. I’m not sure it makes sense to treat this as a single homogeneous sample, but for now let’s take a look at a few features of the entire data set. I’ll also take a sneak peek at a particularly interesting pair of galaxies.

First, here is a popular absorption line diagnostic, the Lick HδA – Dn(4000 Å) plane. Points are colored by BPT diagnostic determined from the [N II]/Hα and [O III]/Hβ ratios by the usual criteria. The contours are from measurements of a large sample of SDSS galaxies by the MPA-JHU pipeline, which was run on spectra through DR8.

Lick HδA versus 4000Å break strength – MaNGA post starburst sample. Contour lines are for a large SDSS sample with measurements from the MPA-JHU pipeline.

It seems odd that the bulk of the measurements in this sample are displaced from the bulk of the SDSS sample. I wouldn’t completely rule out errors in my measurements but I tested mine against the MPA-JHU measurements a long time ago, and this particular part of the code is unchanged for some time. Anyway, we see a large range of values of these diagnostics, but with relatively few in the passively evolving region at lower right and many in the “green valley.” Almost 1/3 have strong Balmer absorption with HδA > 5Å EW. Many of these also have star forming BPT diagnostics, so it’s not clear that these regions are post starbursts.

Next, here are (100 Myr averaged) star formation rates plotted against stellar mass density, again color coded by BPT diagnostic. The straight line is my calibration of the center of the “star forming main sequence” from some time ago.

Modeled SFR density vs. stellar mass density – MaNGA post starburst sample.

Evidently there are many regions — mostly with star forming emission line ratios — lying near the star forming main sequence, and also a large number in the green valley. Most of those have weak emission lines, AGN, or LINER-like ratios.

Finally, here is a plot of model specific star formation rate against Dn(4000Å). As I’ve written before a number of authors have noted the relation between the 4000Å break strength and stellar age or specific star formation rate and several have used it as a (usually secondary) star formation rate indicator. The straight line is my estimate of the mean relation for spiral galaxies, originally given in this post.

Model specific star formation rate versus 4000Å break strength – MaNGA post starburst sample.

Evidently by these diagnostics this sample has properties that at least overlap with a random selection of normal galaxies. The only thing notably missing are “red and dead” ETGs. However there are good reasons to think that starbursts – and therefore post starbursts – are generally localized regions within galaxies. We need to look at the spatially resolved properties — specifically star formation histories — to see how many genuine post starburst galaxies are in the sample.

I’m going to end for now with one of the more interesting examples in this sample. The western member of this interacting pair has a remarkably bright and white nucleus, which in SDSS imaging indicates a fairly young stellar population.

SDSS J095343.89-000524.7 MaNGA plateifu 10843-9101 (mangaid 1-897)

I slightly altered my usual workflow for this and a few other data sets in this sample. Usually I try to use all spectra and bin to a minimum target SNR (usually 5) for all bins, but since this IFU had a large fraction of blank sky I set the SNR threshold for accepting a bin lower than I otherwise would and left the lowest SNR spectra out of the analysis. Below is a map of the modeled stellar mass density showing the coverage of the analyzed area.

MaNGA plateifu 10843-9101 (mangaid 1-897). Model stellar mass density; analysis coverage

And for now I’ll just show the spectra of the two nuclear regions with posterior predictive fits of the SFH models, along with model star formation histories. The western nucleus has a remarkable K+A like spectrum but with fairly strong emission from a possible AGN. The model star formation history is one of the most unusual I’ve seen. Whether it’s an accurate record of events is of course uncertain.

MaNGA plateifu 10843-9101 (mangaid 1-897). Nuclear spectra with posterior predictive fits and model star formation histories.

I’m going to continue this topic in additional post(s), and perhaps look for a larger sample. A recent paper by Cheng et al. (2024) found nearly 500 galaxies with post starburst properties in MaNGA, but there seems to be no catalog. I’m not sure their selection criteria are easily reproduced.

A little more on the BPASS SSP models

Three posts ago I did a brief comparison of my usual EMILES based models with the most recent version of Stanway and Eldridge’s BPASS models, which are the first purely theoretical model spectra that seem possibly suitable for full spectrum fitting. I mentioned then that I used a set of models with an age zero upper mass limit of 300 M, while most model libraries adopt an upper mass limit of 100 M. As is customary in this industry their website contains a number of additional model libraries, including with upper stellar mass limits of 100 M and some with single star evolution only. These alternate sets of models have the same structure as the baseline models, so I just used the same scripts to create R readable data sets with the same ages and subset of metallicities..

Not surprisingly the 1 Myr models with 300 M upper limit are considerably more luminous than either the binary or single star models with 100 M limits, but the latter are still somewhat more luminous than my standard library at the same age. Stars >100 M evolve very rapidly and the difference in model spectra disappears by log(T) = 6.6 (4 Myr).

BPASS solar metallicity age 0 model spectra

To test the differences in model star formation histories I just ran one set of models on the post-starburst galaxy WISEA J080218.38+323207.8 (MaNGA plateifu 10220-3704, mangaid 1-201936). And here’s the comparison:

Model star formation histories for 3 SSP model libraries. MaNGA plateifu 10220-3703 (mangaid 1-201936)

The basic result is there’s no real difference either with the 300 M upper limit models or between the binary and single star evolution models1note that one spectrum in the fifth row above has a rather different model SFH for the binary library. This turned out to be from a convergence failure of the sampler in one chain. Again there’s a sharp downturn in star formation rate at the youngest ages and again there’s that peculiar spike at 1.6 Gyr in all model runs. That feature has serious consequences for the interpretation of the model star formation histories in these post-starburst galaxies. In this case the EMILES based models indicate a strongly centrally concentrated burst that began ~1 Gyr ago and lasted several hundred Myr in the center, while fading away to no significant enhancement outside a few kpc from the center. The BPASS models on the other hand have two distinct bursts near the center that straddle those of EMILES, with a significant amount of mass in a short burst throughout the galaxy. While not necessarily implausible the persistence of the 1.6Gyr spike (and as noted before not just in this galaxy) makes me suspect an artifact of some sort.

As a little bit of an aside this galaxy has one published estimate of a detailed star formation history by French et al. (2018) based on GALEX and SDSS photometry and SDSS spectra (not MaNGA). Their best fit model has two bursts at ~500 Myr and 1.5 Gyr with a total mass contribution of ~20 – 65%. Since the SDSS spectra were 3″ diameter this would be for the central region only. This at least broadly agrees with either the EMILES or BPASS based models. I have roughly 75% of the mass in the burst (EMILES) in the central fiber with somewhat more in the BPASS models, but that drops rapidly.

Well, the quest for an updated SSP library continues. Unfortunately the two likely sources of MaStar based models have yet to publish updates. I’m still considering doing my own. Unfortunately I’m not aware of any open (or for that matter closed) source software for generating SSP model spectra. This seems to be something of a dark art.

A little more on star formation history priors

After my not so insightful realization recounted last time that my attempt to modify the prior on star formation histories wasn’t actually doing anything I thought a little further about how to specify one. Gaussians are always popular choices for priors, so why not give them a try? For a first cut I added the following lines to the “transformed data” section of the Stan model:

  vector[nt*nz] norm_sfr;
  
  norm_sfr = (dT .* norm_st)/sum(dT .* norm_st) ;

Then I added a scale parameter sd_sf to the parameters section, and finally in the model section:

    sd_sf ~ normal(0., 1.);
    b_st_s ~ normal(norm_sfr, sd_sf);

Even though the prior allows values of the stellar contributions that are infeasible given the simplex declaration this causes no technical problems: the models sample without complaint and parameters satisfy all constraints. Execution times are comparable to the original model formulation and convergence diagnostics are OK. But, the model runs had some unexpected features. I did a set of model runs for a single MaNGA galaxy from the post-starburst ancillary sample — mangaid 1-201936 (plateifu 10220-3703). Model star formation histories compared to the original model are shown below for the 58 binned spectra:

MaNGA plateifu 10220-3703 (mangaid 1-201936). Model star formation histories with two different SFH priors.

What’s striking here is that several of the spectra in the low S/N outskirts of the galaxy have nearly constant star formation rates with very little sample variation. In other words the models are basically returning the priors. The cause of this behavior isn’t quite clear. Relatively low signal to noise seems to be necessary, but not sufficient since similarly noisy spectra have essentially the same SFH’s as the original model formulation. It also isn’t due to convergence failure because much longer runs with more adaptation iterations show the same behavior. It is possible perhaps that the posteriors are significantly multimodal and Stan is preferentially falling into one of them. Notable also is that the fits to the data measured by log-likelihood are virtually identical even for the runs with the anomalous SFH’s. At the very least this tells us that uncertainties in quantities derived from the models are considerably larger than within model run variations — of course I have always believed this and said so a number of times.

After trying several variations on this theme that either had none at all or undesirable effects on sampling, and after some additional consideration I think that, given the model parametrization, the uniform on the simplex prior for stellar contributions is actually the one I want. That leaves the question of what, if anything, to do about the abrupt jumps in model star formation rates.

One possibility is simply to redefine the endpoints of the age bins to be, say, halfway between nominal SSP ages instead of at the model ages as is my current practice. In the case of the EMILES library this would mean for example that the 3.75, 4, 4.5 Gyr bins would have widths of 0.25, 0.375, 0.5 Gyr instead of the present 0.25, 0.25, 0.5. This involves no change to the actual model runs at all, so most quantities derived from the models are unchanged.

Another solution is to adopt a library with a more uniform age progression. One with approximately equal increments in log age seems preferred. As yet there have been no published updates to the MaStar based SSP libraries mentioned last time, so I’m waiting for them, while still considering generating my own.

I’m going to briefly return to BPASS based models. After that I’m not sure.

Priors on stellar contributions

One thing I’ve wondered about for a while is the extent that priors on SSP model contributions affect modeled star formation histories. Previous experience suggests not much at all unless the prior is highly constraining. To be a little more specific in my current workflow I normalize the SSP model spectra to have average flux values of 1 in (approximately) the V band, and also adjust the galaxy fluxes in the same way. In the Stan model the stellar contribution parameters are declared as a simplex, that is a vector with non-zero elements summing to 1. That makes the parameter values the fractional contributions to the (unreddened) galaxy flux at V. My current working code doesn’t provide an explicit prior for the stellar parameters, but they have an implicit proper prior of a uniform distribution on the appropriate dimensional simplex. More technically the prior is a Dirichlet distribution with all concentration parameters equal to 1. Note the marginal distributions aren’t uniform, but they are all the same. One implication of that is that a typical draw from the prior1Stan doesn’t sample from the prior even for initialization, but of course the prior influences the posterior through Bayes’ rule. will have jumps in the star formation rate at exactly the times where the width of the age bins jump, a problem that I’ve noted several times before.

A possible solution to this problem is simply to alter the prior to encourage smoothly varying star formation rate rather than smoothly varying light contributions. It turns out I’ve been feeding my Stan code the data I need to do that: the initial mass in a given model SSP is inversely proportional to the normalization factor applied to the spectrum, and the star formation rate is just the mass divided by the time interval assigned to the SSP. Both of those quantities are passed as data to the Stan model even though they weren’t used in any way previously. I added just 3 lines to the code to change the prior on the stellar contributions. In the “transfored data” section

norm_sfr = 1.0 ./ (dT .* norm_st);
norm_sfr = norm_sfr / mean(norm_sfr);

defines the reweighting of the prior, which is written in the model section as

    target += dirichlet_lupdf((b_st_s .* norm_sfr)/sum(b_st_s .* norm_sfr)|rep_vector(1., nt*nz)); 

If I did this right the prior is completely agnostic to any star formation history, whereas the previous implicit prior was completely agnostic to any run of light contributions.

The modified code compiled without complaint and there’s no discernible difference in either execution time or convergence diagnostics.

I’ve only run this on one set of galaxy data, from MaNGA plateifu 8565-3703 (mangaid 1-92735). This is one of Schawinski’s “blue early type galaxies,” chosen mostly because the data were binned to just 34 spectra so a complete set of model runs only took a few hours. And, as seen below the other things that don’t change are the modeled star formation histories.

I need to do some more validation exercises, but it appears my long ago conclusion that the choice of prior on the star formation history has little effect was correct. The data dominates the model outcome through the likelihood.

Update

Since I hit publish a few days ago I realized two things. First, the weights I applied to the stellar contributions to “encourage” a smoothly varying star formation rate were inverted. What I should have used was:

transformed data {
  vector[nt*nz] norm_sfr = (dT .* norm_st) / mean(dT .* norm_st);
}

The second thing I realized is this makes no difference at all given the form of the prior. The transformation simply maps a point on the simplex to another point on the simplex that has exactly the same probability density since by construction the prior is uniform on the simplex.

So, it should have been expected, and it’s a good thing that it in fact happened that the model runs produced the same results. What happens if I use the following prior, with the weights supplying the parameters for the Dirichlet?

target += dirichlet_lupdf(b_st_s | norm_sfr);

This failed to sample almost always. I’m not entirely sure why, but I suspect this turns a problem with relatively simple geometry at least with regards to the prior to one with a complex and troublesome geometry.

This little experiment actually told me nothing about the effect of priors on model star formation histories. Two of the priors are actually the same, and the third fails for reasons that aren’t completely clear. I may experiment with different forms of prior. I’m still, of course, looking for a new SSP model library.

Considering alternative SSP model libraries

For a few years now I’ve been using a library of SSP1Simple Stellar Population model spectra based on the EMILES models of Vazdekis et al. supplemented with some young stellar populations taken from the “HR-pypopstar” models described in Irigoyen et al. (2021) and retrieved from fractal-es.com/PopStar. As I’ve mentioned several times there are a few issues with the models. First, the abrupt changes in time intervals of the BaSTI isochrone based models produce sharp jumps in model star formation rate estimates at several lookback times. This is because my star formation history models “want” to produce smoothly varying contribution estimates, which lead to abrupt jumps in rates when the time interval decreases.

Another possible issue is there are multiple sources of data used for the SSP model spectra. Besides the pypostar models for young populations the older population spectra use MILES stellar spectra for wavelengths up to 7410Å, with spectra from another source grafted on for longer wavelengths within the relevant range for SDSS. This wavelength happens to fall on a wide absorption feature (I believe mostly due to TiO) that’s prominent in older populations, and this region is often fit rather poorly by the models. Although there are other possible reasons for this I’ve long suspected that it might be due to different flux calibrations of the two data sources. Ideally I’d like to have single, homogeneous data source covering the full wavelength range of SDSS spectra.

That ideal data source may exist in the SDSS MaStar stellar spectrum library, which is a large collection of stellar spectra observed as part of the MaNGA project. Since the observations used the same BOSS spectroscopes they have the same wavelength coverage, spectral resolution, and flux calibration as the MaNGA galaxy spectra. So far I’ve found 2 MaStar based SSP model libraries. First is one described originally in Maraston et al. (2020) with data available at https://www.icg.port.ac.uk/mastar/. This was based on the preliminary (DR15) data release and has a minimum age of 300Myr, which is unsuitable for my purposes. An updated version with sufficiently young models is described in Nanni et al. (2022, 2023, 2024). That version has not yet been published as of the date of this post.

A more immediately usable set of models were described in Sanchez et al. (2022) and retrieved from http://ifs.astroscu.unam.mx/pyPipe3D/templates/. These are based on the 2019 update of the venerable Bruzual and Charlot (2003; BC03) models, with MaStar spectra substituting for MILES+IndoUS in the relevant wavelength range. The spectra were taken from the initial (DR15) data release and an update is promised to be forthcoming.

The linked directory contains a large number of FITS files containing subsets of the full data set(s) — both the CB19 and MaStar based models are included. Since I wanted to select my own subset I downloaded the presumed full data sets in files named “MaStar_CB19.all_1_5.fits.gz” and “MaStar_CB19.all_1_5.fits.gz.” These contain model spectra and some other data for 210 irregularly spaced ages (not 220 as stated in the paper) and 16 metallicities. Very young ages are over-represented — there are 23 under 1 Myr and another 55 between 1 and 10 Myr. Beyond 3 Gyr the interval between models is fixed at 0.25 Gyr. For a preliminary evaluation I chose 47 ages from 1 Myr to 14 Gyr and 5 metallicities ranging from somewhat subsolar (Z = 0.004) to the highest available (Z = 0.06). I used the model spectra as given with wavelengths from 3625-9998.5Å at 1.5Å spacing. I built libraries for both the CB19 and MaStar based spectra, but will only discuss the latter for now.

I also decided to reevaluate the most recent release of the “BPASS” evolution models of Stanway and Eldridge (2018) with data retrieved from links at https://bpass.auckland.ac.nz/9.html. These are purely theoretical models including predictions for spectra, and are unique in attempting to account for binary star evolution. I had looked at a much earlier version and decided that the model continua were much too blue to be useful for full spectrum fitting. This appears to be no longer much of an issue.

Again, there are a large number of files in the data directory with different choices of IMF and upper mass limits. For a first look I chose the file bpass_v2.2.1_imf135_300.tar.gz, which corresponds to a Kroupa IMF with upper mass limit of 300 M 2which may have been a mistake since the upper mass limit in all the other libraries I’ve tried is 100M. Models are available for 11 metallicities, and again I chose 5 with values ranging from Z=0.004 to Z=0.04 (the highest available). The available ages are in equal logarithmic intervals from 106 all the way up to 1011 years with a spacing of 0.1 dex. I just chose all available ages for log(T) = 6 to 10.1, for a total of 42. According to the documentation ages are meant to represent the middle of each time interval. So far I’ve adopted the convention that ages represent the beginning of each time interval of width equal to the time to the next younger age, so for consistency I add 0.05 dex to each model age. I extracted the model spectra in the wavelength range 3500-9499Å, which are tabulated at 1Å intervals.

For a preliminary evaluation I calculated models for just two MaNGA galaxies. Plateifu 10220-3703 (mangaid 1-201936; NED name WISEA J080218.38+323207.8) is a post-starburst taken from the compilation of Melnick and dePropris — one of 8 in the final MaNGA release. The other is a late type spiral, plateifu 11018-12704 (mangaid 1-233951; UGC 8162). This was a more or less random choice drawn from a sample that was intended to be a high purity selection of MaNGA face on spirals based on Galaxy Zoo classifications and NSA axis ratios. These two span the range of galaxy types that I’m likely to want to examine in the near future.

2MASX J08021836+3232078 – MaNGA plateifu 10220-3703 (mangaid 1-201936)
UGC 8162 – MaNGA plateifu 11018-12704 (mangaid 1-233951)

I’m just going to look at a few results of model runs. Execution times, convergence diagnostics, and fits to the data as measured by summed log-likelihoods were all similar so there’s no strong reason to favor one library based on those criteria.

Turning first to the post-starburst galaxy 2MASX J08021836+3232078, here are model star formation rate histories for all binned spectra and all three tested SSP libraries. The ribbons denoting the marginal 95% credible intervals of SFR mostly overlap3note these are log-log plots that span 4 orders of magnitude in star formation rate, which is encouraging, but the model mass growth histories are rather different as seen below4as something of an aside several panels show a strange zigzag pattern. This is a rendering issue with the graphics software rather than a sampling problem. Note in particular that the mass growth histories by construction are monotonic.

Both the EMILES and BPASS models have a strong, centrally concentrated quenched starburst with two peaks, but with slightly different timing. The MaStar based models on the other hand show a long period of quiet evolution with a later and weaker starburst in the central region. A very peculiar feature of the BPASS models is a spike in SFR in almost every model run at logT = 9.2 (1.6 Gyr). A little more on this below.

There are some fairly large differences in summary quantities I track. The average stellar mass density is 0.14 ± 0.05 dex larger in the MaStar based models than EMILES, while it’s 0.12 ± 0.05 dex smaller in the BPASS models. The (100 Myr average) star formation rate density is 0.3 ± 0.1 dex larger in the MaStar models and 0.09 ± 0.16 dex larger in the BPASS models. The BPASS models have greater optical depths of dust attenuation (by 0.14 on average) and redder attenuation curves (<δ> ≈ 0.5). This may indicate that the BPASS spectra still have bluer continua than EMILES.

Turning to the spiral, here are the same two sets of plots for UGC 8162. Again the model SFR histories overlap and all three indicate roughly constant star formation over cosmic history with perhaps even slightly increasing in the outskirts. Close examination of the mass growth histories show the MaStar based models favor slightly faster early time growth than the other two.

Although less obvious many of the BPASS model runs have a distinct spike in SFR at the same 1.6 Gyr as in the post-starburst models. In some of the spectra it’s strong enough to contribute significantly to the present day stellar mass. A few other peculiarities are harder to see. The MaStar models invariably have an upturn in SFR at the youngest age (1 Myr), while the BPASS models invariably have a sharp downturn at the youngest age.

There are similar offsets in stellar mass densities to the post-starburst, by 0.12 and -0.14 ± 0.03 in MaStar and BPASS respectively. The mean star formation rate densities are 0.25 ± 0.05 larger in MaSTar but 0.18 ± 0.07 dex lower in BPASS models.

For a quick look at how these model differences affect some key relationships, below are plots of the mean (and standard deviations) of SFR density against stellar mass density (L) and SFR density vs. Hα luminosity density (corrected for estimated stellar attenuation only) (R) for UGC 8162. The straight lines are my estimate of the “star forming main sequence”, and on the right the Calzetti calibration. As expected the models straddle the “spatially resolved star forming main sequence” that I estimated previously5data for this galaxy was recently downloaded and the model results did not contribute to my estimate.. Note that since the BPASS model estimates of both SFR and stellar mass are offset by similar amounts from the EMILES based models the points are just shifted downward along essentially the same relationship. The MaStar based models appear slightly offset to higher SFR values at a given stellar mass density. On the right, there’s a fairly clear stratification. The Hα luminosity estimates are nearly identical for all 3 sets of model runs, so we see the differences in SFR density estimates.

I don’t really have an explanation for the 1.6 Gyr spike in the BPASS models. It’s happening in the two super-solar metallicity bins, which track with the others for both younger and older ages. One possible clue is in their spectra. The broad absorption features in the red are considerably deeper at both younger and older ages6spectra are offset vertically for clarity.. Whether there’s a physical reason for this and why it would affect the models as seen is unknown to me.

BPASS SSP model spectra for Z=0.03, T=1.25, 1.6, and 2 Gyr

There are some notable differences among the libraries at young ages at well. Below are plotted the 1 and 10 Myr spectra at solar metallicity for the 3 libraries. The “EMILES” spectra recall are from the theoretical PyPopstar models with continuum emission included. The BPASS model is considerably more luminous and bluer at 1Myr than the other two and evolves more rapidly at young ages. Is this because I chose the models with 300 M upper mass limit? I should find out.

BPASS, EMILES, and MaStar solar metallicity model spectra at 1Myr
BPASS, EMILES, and MaStar solar metallicity model spectra at 10 Myr

To conclude for now I haven’t quite decided what to replace my existing EMILES models with, if anything. I’m still optimistic that an MaStar based library is the way to go, but the version published by Sanchez et al. isn’t quite ready for production use. I may consider developing my own library although it’s outside the scope of my interests.

MaNGA in M31 – final word for now and some pitfalls

When I was doing my initial fits to the M31 MaNGA spectra I noted two that I initially thought were contaminated by foreground stars, and therefore I masked them to prevent further analysis. One of the two, in MaNGA plateifu 9677-12701 (mangaid 52-8) is a certain foreground star and won’t be discussed further. The other, in plateifu 9678-12703 (mangaid 52-23), turns out to be a luminous red supergiant that’s a genuine resident of M31. This is confirmed by two nearly contemporaneous catalogs of M31 red supergiants: the one by Ren et al. (2021) that I noted previously and Massey et al. (2021), which I stumbled upon more recently.

In fact there are two comparably bright red supergiants in this IFU. One that’s about 9″ north of the masked one probably should have been masked by whatever criteria I used, but it’s likely I failed to notice the fit to the data since I don’t have the patience to look at every spectrum and data fit that pops up. So, here is the spectrum, displayed in (negative) magnitudes with arbitrary zero point. The blue spectrum is the closest match in my SSP library, a 10 Myr old population with the highest metallicity I used (2.5 Z. This is one of the theoretical spectra from PyPopstar). This sorta looks right except it’s much too blue. The solution to that is, of course, to add some reddening through dust attenuation.

plateifu 9678-12703 (M31 10 kpc ring) Spectrum contaminated with red supergiant and closest match SSP model spectrum

The maximum likelihood (non-negative weighted least squares) fit did just that, with only a single stellar contributor and a very high dust attenuation of τV = 3.0. This still doesn’t quite work: the residuals are rather strongly sloped in the blue and the details of the absorption features in the red aren’t quite right.

plateifu 9678-12703 – NNLS fit to spectrum contaminated with red supergiant

I still use a Calzetti attenuation relation in my NNLS fits. The Bayesian fits using Stan have the more flexible attenuation prescription that I described back in this post, and that helped considerably with the continuum as seen in the plot below. The absorption features in the red still aren’t fit well. The model has an even more extreme attenuation estimate with a much “grayer” than Calzetti slope, with τV = 4.38 ± 0.05 and δ = -0.33 ± 0.011see the link above for the meaning of these parameters.

9678-12703 – posterior predictive fit to spectrum

The model star formation history (displayed as a mass growth history below) isn’t completely implausible. The presence of a very luminous evolved star indicates the region is at least some Myr old, and a rapid onset and decline of star formation is typical for star forming regions in mature spirals. The recent episode of star formation added about 7% to the present day stellar mass, while at least 60% was in place by 8 Gyr ago (per the model).

plateifu 9678-12703 (M31 10 kpc ring) Model mass growth history for a region containing a bright red supergiant

Nevertheless I consider the model results to be highly suspect, mainly based on the very large optical depth. Massey estimates the attenuation for the red supergiant to be AV ≈ 1.18, although this is apparently based on a formula rather than a direct empirical estimate. But another piece of evidence that’s close to a smoking gun is an estimate based on the Balmer decrement of emission. Despite the lack of apparent ionizing sources outside the bright H II regions in the west there is widespread diffuse emission in this region with star-forming like line ratios. The estimated optical depth derived from the Balmer decrement for this region is τV, bd = 1.47 ± 0.23 (1σ), reasonably consistent with the Massey estimate and with the values derived for the rest of the IFU.

How widespread a problem is this? Below I plot the Balmer decrement derived optical depths against the stellar based estimates for all spectra in M31 MaNGA with star forming emission line ratios, about 11% of the entire sample. The 5 most extreme outliers are in this IFU in the regions surrounding the two bright red supergiants (the masked spectrum would also be in this region of the plot). The same 5 regions are also extreme outliers in the SFR vs. stellar mass and SFR vs. Hα plots that I showed early on. So, even though there are many cataloged supergiants in the study region these two appear to be uniquely bright and to have had the largest impact on model results.

M31 MaNGA – Optical depth off attenuation estimated from Balmer decrement vs. model values of τV

Of course there are hotter bright stars in the study area and these could affect results in different and possibly unexpected ways. For example the outermost IFU contains one bright star that GAIA estimates has a surface temperature of 5500 C, which would make it a G supergiant if it’s in M31. I noted in the last post that the model star formation history for that region looks like a post starburst with an age around 800 Myr. This is, I think, several galactic rotation periods, and stars born that long ago should have dispersed by now unless they’re gravitationally bound. There’s no sign of a star cluster there nor is there a cataloged one nearby, so it seems likely to me that the “starburst” is an artifact. As I noted in the last post though the fit to the data is quite good.

These examples illustrate an issue that’s fairly well known, which is that using simple stellar populations as building blocks of low mass stellar systems are potentially affected by so-called “stochastic” effects, which simply means that the distribution of stellar masses can vary randomly from what’s assumed in the SSP models. Specifically, in M31 there are individual stars luminous enough to affect spectra. One possible solution might be to add some stellar spectra to the library. I might give that a try some day.

I’m going away and won’t be writing for a while. I’m hoping to acquire or build a SSP model library based on SDSS MaStar spectra yet this year. This is a much larger collection of stellar spectra than has been previously available and it has the advantage of having the same flux calibration and (approximately) spectral resolution as the SDSS galaxy spectra. I also plan to return to my study of post-starburst galaxies.