.. DO NOT EDIT.
.. THIS FILE WAS AUTOMATICALLY GENERATED BY SPHINX-GALLERY.
.. TO MAKE CHANGES, EDIT THE SOURCE PYTHON FILE:
.. "auto_examples/bayesian_swissmetro/plot_b15_panel_discrete.py"
.. LINE NUMBERS ARE GIVEN BELOW.

.. only:: html

    .. note::
        :class: sphx-glr-download-link-note

        :ref:`Go to the end <sphx_glr_download_auto_examples_bayesian_swissmetro_plot_b15_panel_discrete.py>`
        to download the full example code.

.. rst-class:: sphx-glr-example-title

.. _sphx_glr_auto_examples_bayesian_swissmetro_plot_b15_panel_discrete.py:


.. _plot_b15_panel_discrete:

15. Discrete mixture with panel data
====================================

Bayesian estimation of a discrete mixture of logit models, also called latent
class model.  The datafile is organized as panel data.


Michel Bierlaire, EPFL
Sat Nov 15 2025, 17:39:13

.. GENERATED FROM PYTHON SOURCE LINES 15-29

.. code-block:: Python


    import biogeme.biogeme_logging as blog
    from IPython.core.display_functions import display
    from biogeme.bayesian_estimation import BayesianResults, get_pandas_estimated_parameters
    from biogeme.biogeme import BIOGEME
    from biogeme.expressions import (
        Beta,
        DistributedParameter,
        Draws,
        exp,
        log,
    )
    from biogeme.models import logit


.. GENERATED FROM PYTHON SOURCE LINES 30-31

See the data processing script: :ref:`swissmetro_panel`.

.. GENERATED FROM PYTHON SOURCE LINES 31-48

.. code-block:: Python

    from swissmetro_panel import (
        CAR_AV_SP,
        CAR_CO_SCALED,
        CAR_TT_SCALED,
        CHOICE,
        SM_AV,
        SM_COST_SCALED,
        SM_TT_SCALED,
        TRAIN_AV_SP,
        TRAIN_COST_SCALED,
        TRAIN_TT_SCALED,
        database,
    )

    logger = blog.get_screen_logger(level=blog.INFO)
    logger.info('Example b15_panel_discrete.py')


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    Example b15_panel_discrete.py 


.. GENERATED FROM PYTHON SOURCE LINES 49-50

Parameters to be estimated. One version for each latent_old class.

.. GENERATED FROM PYTHON SOURCE LINES 50-53

.. code-block:: Python

    NUMBER_OF_CLASSES = 2
    b_cost = [Beta(f'b_cost_class{i}', 0, None, None, 0) for i in range(NUMBER_OF_CLASSES)]


.. GENERATED FROM PYTHON SOURCE LINES 54-56

Define a random parameter, normally distributed across individuals,
designed to be used for Monte-Carlo simulation

.. GENERATED FROM PYTHON SOURCE LINES 56-58

.. code-block:: Python

    b_time = [Beta(f'b_time_class{i}', 0, None, None, 0) for i in range(NUMBER_OF_CLASSES)]


.. GENERATED FROM PYTHON SOURCE LINES 59-60

It is advised not to use 0 as starting value for the following parameter.

.. GENERATED FROM PYTHON SOURCE LINES 60-71

.. code-block:: Python

    b_time_s = [
        Beta(f'b_time_s_class{i}', 1, None, None, 0) for i in range(NUMBER_OF_CLASSES)
    ]
    b_time_rnd = [
        DistributedParameter(
            f'b_time_rnd_class{i}',
            b_time[i] + b_time_s[i] * Draws(f'b_time_eps_class{i}', 'NORMAL'),
        )
        for i in range(NUMBER_OF_CLASSES)
    ]


.. GENERATED FROM PYTHON SOURCE LINES 72-73

We do the same for the constants, to address serial correlation.

.. GENERATED FROM PYTHON SOURCE LINES 73-113

.. code-block:: Python

    asc_car = [
        Beta(f'asc_car_class{i}', 0, None, None, 0) for i in range(NUMBER_OF_CLASSES)
    ]
    asc_car_s = [
        Beta(f'asc_car_s_class{i}', 1, None, None, 0) for i in range(NUMBER_OF_CLASSES)
    ]
    asc_car_rnd = [
        DistributedParameter(
            f'asc_car_rnd_class{i}',
            asc_car[i] + asc_car_s[i] * Draws(f'asc_car_eps_class{i}', 'NORMAL'),
        )
        for i in range(NUMBER_OF_CLASSES)
    ]

    asc_train = [
        Beta(f'asc_train_class{i}', 0, None, None, 0) for i in range(NUMBER_OF_CLASSES)
    ]
    asc_train_s = [
        Beta(f'asc_train_s_class{i}', 1, None, None, 0) for i in range(NUMBER_OF_CLASSES)
    ]
    asc_train_rnd = [
        DistributedParameter(
            f'asc_train_rnd_class{i}',
            asc_train[i] + asc_train_s[i] * Draws(f'asc_train_eps_class{i}', 'NORMAL'),
        )
        for i in range(NUMBER_OF_CLASSES)
    ]

    asc_sm = [Beta(f'asc_sm_class{i}', 0, None, None, 1) for i in range(NUMBER_OF_CLASSES)]
    asc_sm_s = [
        Beta(f'asc_sm_s_class{i}', 1, None, None, 0) for i in range(NUMBER_OF_CLASSES)
    ]
    asc_sm_rnd = [
        DistributedParameter(
            f'asc_sm_rnd_class{i}',
            asc_sm[i] + asc_sm_s[i] * Draws(f'asc_sm_eps_class{i}', 'NORMAL'),
        )
        for i in range(NUMBER_OF_CLASSES)
    ]


.. GENERATED FROM PYTHON SOURCE LINES 114-115

Class membership probability. Note: for Bayesian estimation, this should not call the logit model.

.. GENERATED FROM PYTHON SOURCE LINES 115-119

.. code-block:: Python

    score_class_0 = Beta('score_class_0', -1.7, None, None, 0)
    probability_class_1 = 1 / (1 + exp(score_class_0))
    probability_class_0 = 1 - probability_class_1


.. GENERATED FROM PYTHON SOURCE LINES 120-121

In class 0, it is assumed that the time coefficient is zero.

.. GENERATED FROM PYTHON SOURCE LINES 121-123

.. code-block:: Python

    b_time_rnd[0] = 0


.. GENERATED FROM PYTHON SOURCE LINES 124-125

Utility functions.

.. GENERATED FROM PYTHON SOURCE LINES 125-142

.. code-block:: Python

    v_train_per_class = [
        asc_train_rnd[i] + b_time_rnd[i] * TRAIN_TT_SCALED + b_cost[i] * TRAIN_COST_SCALED
        for i in range(NUMBER_OF_CLASSES)
    ]
    v_swissmetro_per_class = [
        asc_sm_rnd[i] + b_time_rnd[i] * SM_TT_SCALED + b_cost[i] * SM_COST_SCALED
        for i in range(NUMBER_OF_CLASSES)
    ]
    v_car_per_class = [
        asc_car_rnd[i] + b_time_rnd[i] * CAR_TT_SCALED + b_cost[i] * CAR_CO_SCALED
        for i in range(NUMBER_OF_CLASSES)
    ]
    v_per_class = [
        {1: v_train_per_class[i], 2: v_swissmetro_per_class[i], 3: v_car_per_class[i]}
        for i in range(NUMBER_OF_CLASSES)
    ]


.. GENERATED FROM PYTHON SOURCE LINES 143-144

Associate the availability conditions with the alternatives.

.. GENERATED FROM PYTHON SOURCE LINES 144-146

.. code-block:: Python

    av = {1: TRAIN_AV_SP, 2: SM_AV, 3: CAR_AV_SP}


.. GENERATED FROM PYTHON SOURCE LINES 147-149

The choice model is a discrete mixture of logit, with availability conditions
We calculate the conditional probability for each class.

.. GENERATED FROM PYTHON SOURCE LINES 149-153

.. code-block:: Python

    conditional_probability_per_class = [
        logit(v_per_class[i], av, CHOICE) for i in range(NUMBER_OF_CLASSES)
    ]


.. GENERATED FROM PYTHON SOURCE LINES 154-155

Conditional to the random variables, likelihood for the individual.

.. GENERATED FROM PYTHON SOURCE LINES 155-160

.. code-block:: Python

    conditional_choice_probability = (
        probability_class_0 * conditional_probability_per_class[0]
        + probability_class_1 * conditional_probability_per_class[1]
    )


.. GENERATED FROM PYTHON SOURCE LINES 161-162

We need the log probability per observation

.. GENERATED FROM PYTHON SOURCE LINES 162-164

.. code-block:: Python

    conditional_log_probability = log(conditional_choice_probability)


.. GENERATED FROM PYTHON SOURCE LINES 165-174

.. code-block:: Python

    the_biogeme = BIOGEME(
        database,
        conditional_log_probability,
        warmup=4000,
        bayesian_draws=4000,
        chains=4,
    )
    the_biogeme.model_name = 'b15_panel_discrete'


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    Default values of the Biogeme parameters are used. 
    File biogeme.toml has been created 


.. GENERATED FROM PYTHON SOURCE LINES 175-176

Estimate the parameters.

.. GENERATED FROM PYTHON SOURCE LINES 176-183

.. code-block:: Python

    try:
        results = BayesianResults.from_netcdf(
            filename=f'saved_results/{the_biogeme.model_name}.nc'
        )
    except FileNotFoundError:
        results = the_biogeme.bayesian_estimation()


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    Loaded NetCDF file size: 2.6 GB 
    load finished in 19110 ms (19.11 s)


.. GENERATED FROM PYTHON SOURCE LINES 184-186

.. code-block:: Python

    print(results.short_summary())


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    posterior_predictive_loglike finished in 77 ms
    /Users/bierlair/python_envs/venv313/lib/python3.13/site-packages/arviz/stats/stats.py:1667: UserWarning: For one or more samples the posterior variance of the log predictive densities exceeds 0.4. This could be indication of WAIC starting to fail. 
    See http://arxiv.org/abs/1507.04544 for details
      warnings.warn(
    waic_res finished in 136 ms
    waic finished in 136 ms
    /Users/bierlair/python_envs/venv313/lib/python3.13/site-packages/arviz/stats/stats.py:797: UserWarning: Estimated shape parameter of Pareto distribution is greater than 0.70 for one or more samples. You should consider using a more robust model, this is because importance sampling is less likely to work well if the marginal posterior and LOO posterior are very different. This is more likely to happen with a non-robust model and highly influential observations.
      warnings.warn(
    loo_res finished in 33527 ms (33.53 s)
    loo finished in 33527 ms (33.53 s)
    Sample size                                              6768
    Sampler                                                  NUTS
    Number of chains                                         4
    Number of draws per chain                                4000
    Total number of draws                                    16000
    Acceptance rate target                                   0.9
    Run time                                                 0:21:03.687596
    Posterior predictive log-likelihood (sum of log mean p)  -2156.92
    Expected log-likelihood E[log L(Y|θ)]                    -2342.51
    Best-draw log-likelihood (posterior upper bound)         -2227.53
    WAIC (Widely Applicable Information Criterion)           -2724.85
    WAIC Standard Error                                      73.52
    Effective number of parameters (p_WAIC)                  567.93
    LOO (Leave-One-Out Cross-Validation)                     -3043.75
    LOO Standard Error                                       78.29
    Effective number of parameters (p_LOO)                   886.83


.. GENERATED FROM PYTHON SOURCE LINES 187-189

.. code-block:: Python

    pandas_results = get_pandas_estimated_parameters(estimation_results=results)
    display(pandas_results)


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    Diagnostics computation took 417.0 seconds (cached). 
                      Name  Value (mean)  ...   ESS (bulk)    ESS (tail)
    0        score_class_0     -5.286779  ...  1285.328326   1331.441053
    1     asc_train_class0     -2.573163  ...  4465.186224   9404.567724
    2   asc_train_s_class0      0.779934  ...  5687.738337  10422.160967
    3        b_cost_class0      2.754704  ...  2355.876493   1338.078191
    4      asc_sm_s_class0      1.412036  ...  2210.169570   5602.051806
    5       asc_car_class0     -0.832146  ...  3880.861502   8316.122106
    6     asc_car_s_class0      0.965673  ...  4882.557850  10246.075254
    7     asc_train_class1     -0.308288  ...  1416.240418   2993.273341
    8   asc_train_s_class1     -1.060029  ...     7.289857     28.262154
    9        b_time_class1     -6.523038  ...  1834.577879   2143.767214
    10     b_time_s_class1      4.033149  ...  1815.166833   2039.288905
    11       b_cost_class1     -4.217157  ...  2072.092115   2448.375181
    12     asc_sm_s_class1      0.148560  ...    25.929931    422.769703
    13      asc_car_class1      0.509266  ...  3064.176039   5047.972557
    14    asc_car_s_class1      2.017802  ...     7.145398     26.944499

    [15 rows x 12 columns]


.. rst-class:: sphx-glr-timing

   **Total running time of the script:** (7 minutes 50.703 seconds)


.. _sphx_glr_download_auto_examples_bayesian_swissmetro_plot_b15_panel_discrete.py:

.. only:: html

  .. container:: sphx-glr-footer sphx-glr-footer-example

    .. container:: sphx-glr-download sphx-glr-download-jupyter

      :download:`Download Jupyter notebook: plot_b15_panel_discrete.ipynb <plot_b15_panel_discrete.ipynb>`

    .. container:: sphx-glr-download sphx-glr-download-python

      :download:`Download Python source code: plot_b15_panel_discrete.py <plot_b15_panel_discrete.py>`

    .. container:: sphx-glr-download sphx-glr-download-zip

      :download:`Download zipped: plot_b15_panel_discrete.zip <plot_b15_panel_discrete.zip>`


.. only:: html

 .. rst-class:: sphx-glr-signature

    `Gallery generated by Sphinx-Gallery <https://sphinx-gallery.github.io>`_