.. DO NOT EDIT.
.. THIS FILE WAS AUTOMATICALLY GENERATED BY SPHINX-GALLERY.
.. TO MAKE CHANGES, EDIT THE SOURCE PYTHON FILE:
.. "auto_examples/swissmetro/plot_b16panel_discrete_socio_eco.py"
.. LINE NUMBERS ARE GIVEN BELOW.

.. only:: html

    .. note::
        :class: sphx-glr-download-link-note

        :ref:`Go to the end <sphx_glr_download_auto_examples_swissmetro_plot_b16panel_discrete_socio_eco.py>`
        to download the full example code.

.. rst-class:: sphx-glr-example-title

.. _sphx_glr_auto_examples_swissmetro_plot_b16panel_discrete_socio_eco.py:


Discrete mixture with panel data
================================

Example of a discrete mixture of logit models, also called latent class model.
The class membership model includes socio-economic variables.
The datafile is organized as panel data.

:author: Michel Bierlaire, EPFL
:date: Mon Apr 10 12:07:15 2023

.. GENERATED FROM PYTHON SOURCE LINES 14-31

.. code-block:: Python


    import biogeme.biogeme_logging as blog
    import biogeme.biogeme as bio
    from biogeme import models

    from biogeme.expressions import (
        Beta,
        Variable,
        bioDraws,
        MonteCarlo,
        log,
        exp,
        bioMultSum,
        ExpressionOrNumeric,
    )
    from biogeme.parameters import Parameters


.. GENERATED FROM PYTHON SOURCE LINES 32-33

See the data processing script: :ref:`swissmetro_panel`.

.. GENERATED FROM PYTHON SOURCE LINES 33-44

.. code-block:: Python

    from swissmetro_panel import (
        flat_database,
        SM_AV,
        CAR_AV_SP,
        TRAIN_AV_SP,
        INCOME,
    )

    logger = blog.get_screen_logger(level=blog.INFO)
    logger.info('Example b16panel_discrete_socio_eco.py')


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    Example b16panel_discrete_socio_eco.py 


.. GENERATED FROM PYTHON SOURCE LINES 45-46

Parameters to be estimated. One version for each latent class.

.. GENERATED FROM PYTHON SOURCE LINES 46-49

.. code-block:: Python

    NUMBER_OF_CLASSES = 2
    B_COST = [Beta(f'B_COST_class{i}', 0, None, None, 0) for i in range(NUMBER_OF_CLASSES)]


.. GENERATED FROM PYTHON SOURCE LINES 50-52

Define a random parameter, normally distributed across individuals,
designed to be used for Monte-Carlo simulation.

.. GENERATED FROM PYTHON SOURCE LINES 52-54

.. code-block:: Python

    B_TIME = [Beta(f'B_TIME_class{i}', 0, None, None, 0) for i in range(NUMBER_OF_CLASSES)]


.. GENERATED FROM PYTHON SOURCE LINES 55-56

It is advised not to use 0 as starting value for the following parameter.

.. GENERATED FROM PYTHON SOURCE LINES 56-64

.. code-block:: Python

    B_TIME_S = [
        Beta(f'B_TIME_S_class{i}', 1, None, None, 0) for i in range(NUMBER_OF_CLASSES)
    ]
    B_TIME_RND: list[ExpressionOrNumeric] = [
        B_TIME[i] + B_TIME_S[i] * bioDraws(f'B_TIME_RND_class{i}', 'NORMAL_ANTI')
        for i in range(NUMBER_OF_CLASSES)
    ]


.. GENERATED FROM PYTHON SOURCE LINES 65-66

We do the same for the constants, to address serial correlation.

.. GENERATED FROM PYTHON SOURCE LINES 66-97

.. code-block:: Python

    ASC_CAR = [
        Beta(f'ASC_CAR_class{i}', 0, None, None, 0) for i in range(NUMBER_OF_CLASSES)
    ]
    ASC_CAR_S = [
        Beta(f'ASC_CAR_S_class{i}', 1, None, None, 0) for i in range(NUMBER_OF_CLASSES)
    ]
    ASC_CAR_RND = [
        ASC_CAR[i] + ASC_CAR_S[i] * bioDraws(f'ASC_CAR_RND_class{i}', 'NORMAL_ANTI')
        for i in range(NUMBER_OF_CLASSES)
    ]

    ASC_TRAIN = [
        Beta(f'ASC_TRAIN_class{i}', 0, None, None, 0) for i in range(NUMBER_OF_CLASSES)
    ]
    ASC_TRAIN_S = [
        Beta(f'ASC_TRAIN_S_class{i}', 1, None, None, 0) for i in range(NUMBER_OF_CLASSES)
    ]
    ASC_TRAIN_RND = [
        ASC_TRAIN[i] + ASC_TRAIN_S[i] * bioDraws(f'ASC_TRAIN_RND_class{i}', 'NORMAL_ANTI')
        for i in range(NUMBER_OF_CLASSES)
    ]

    ASC_SM = [Beta(f'ASC_SM_class{i}', 0, None, None, 1) for i in range(NUMBER_OF_CLASSES)]
    ASC_SM_S = [
        Beta(f'ASC_SM_S_class{i}', 1, None, None, 0) for i in range(NUMBER_OF_CLASSES)
    ]
    ASC_SM_RND = [
        ASC_SM[i] + ASC_SM_S[i] * bioDraws(f'ASC_SM_RND_class{i}', 'NORMAL_ANTI')
        for i in range(NUMBER_OF_CLASSES)
    ]


.. GENERATED FROM PYTHON SOURCE LINES 98-99

Parameters for the class membership model.

.. GENERATED FROM PYTHON SOURCE LINES 99-102

.. code-block:: Python

    CLASS_CTE = Beta('CLASS_CTE', 0, None, None, 0)
    CLASS_INC = Beta('CLASS_INC', 0, None, None, 0)


.. GENERATED FROM PYTHON SOURCE LINES 103-104

In class 0, it is assumed that the time coefficient is zero

.. GENERATED FROM PYTHON SOURCE LINES 104-106

.. code-block:: Python

    B_TIME_RND[0] = 0


.. GENERATED FROM PYTHON SOURCE LINES 107-108

Utility functions

.. GENERATED FROM PYTHON SOURCE LINES 108-140

.. code-block:: Python

    V1 = [
        [
            ASC_TRAIN_RND[i]
            + B_TIME_RND[i] * Variable(f'{t}_TRAIN_TT_SCALED')
            + B_COST[i] * Variable(f'{t}_TRAIN_COST_SCALED')
            for t in range(1, 10)
        ]
        for i in range(NUMBER_OF_CLASSES)
    ]
    V2 = [
        [
            ASC_SM_RND[i]
            + B_TIME_RND[i] * Variable(f'{t}_SM_TT_SCALED')
            + B_COST[i] * Variable(f'{t}_SM_COST_SCALED')
            for t in range(1, 10)
        ]
        for i in range(NUMBER_OF_CLASSES)
    ]
    V3 = [
        [
            ASC_CAR_RND[i]
            + B_TIME_RND[i] * Variable(f'{t}_CAR_TT_SCALED')
            + B_COST[i] * Variable(f'{t}_CAR_CO_SCALED')
            for t in range(1, 10)
        ]
        for i in range(NUMBER_OF_CLASSES)
    ]
    V = [
        [{1: V1[i][t], 2: V2[i][t], 3: V3[i][t]} for t in range(9)]
        for i in range(NUMBER_OF_CLASSES)
    ]


.. GENERATED FROM PYTHON SOURCE LINES 141-142

Associate the availability conditions with the alternatives

.. GENERATED FROM PYTHON SOURCE LINES 142-144

.. code-block:: Python

    av = {1: TRAIN_AV_SP, 2: SM_AV, 3: CAR_AV_SP}


.. GENERATED FROM PYTHON SOURCE LINES 145-147

The choice model is a discrete mixture of logit, with availability conditions
We calculate the conditional probability for each class.

.. GENERATED FROM PYTHON SOURCE LINES 147-156

.. code-block:: Python

    prob = [
        exp(
            bioMultSum(
                [models.loglogit(V[i][t], av, Variable(f'{t+1}_CHOICE')) for t in range(9)]
            )
        )
        for i in range(NUMBER_OF_CLASSES)
    ]


.. GENERATED FROM PYTHON SOURCE LINES 157-158

Class membership model.

.. GENERATED FROM PYTHON SOURCE LINES 158-162

.. code-block:: Python

    W = CLASS_CTE + CLASS_INC * INCOME
    PROB_class0 = models.logit({0: W, 1: 0}, None, 0)
    PROB_class1 = models.logit({0: W, 1: 0}, None, 1)


.. GENERATED FROM PYTHON SOURCE LINES 163-164

Conditional on the random variables, likelihood for the individual.

.. GENERATED FROM PYTHON SOURCE LINES 164-166

.. code-block:: Python

    probIndiv = PROB_class0 * prob[0] + PROB_class1 * prob[1]


.. GENERATED FROM PYTHON SOURCE LINES 167-168

We integrate over the random variables using Monte-Carlo

.. GENERATED FROM PYTHON SOURCE LINES 168-170

.. code-block:: Python

    logprob = log(MonteCarlo(probIndiv))


.. GENERATED FROM PYTHON SOURCE LINES 171-174

As the objective is to illustrate the
syntax, we calculate the Monte-Carlo approximation with a small
number of draws.

.. GENERATED FROM PYTHON SOURCE LINES 174-177

.. code-block:: Python

    the_biogeme = bio.BIOGEME(flat_database, logprob, number_of_draws=100, seed=1223)
    the_biogeme.modelName = 'b16panel_discrete_socio_eco'


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    Biogeme parameters read from biogeme.toml. 


.. GENERATED FROM PYTHON SOURCE LINES 178-179

Estimate the parameters.

.. GENERATED FROM PYTHON SOURCE LINES 179-181

.. code-block:: Python

    results = the_biogeme.estimate()


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    As the model is rather complex, we cancel the calculation of second derivatives. If you want to control the parameters, change the name of the algorithm in the TOML file from "automatic" to "simple_bounds" 
    *** Initial values of the parameters are obtained from the file __b16panel_discrete_socio_eco.iter 
    Cannot read file __b16panel_discrete_socio_eco.iter. Statement is ignored. 
    The number of draws (100) is low. The results may not be meaningful. 
    As the model is rather complex, we cancel the calculation of second derivatives. If you want to control the parameters, change the name of the algorithm in the TOML file from "automatic" to "simple_bounds" 
    Optimization algorithm: hybrid Newton/BFGS with simple bounds [simple_bounds] 
    ** Optimization: BFGS with trust region for simple bounds 
    Iter.     Function    Relgrad   Radius      Rho      
        0        4e+03      0.034        1     0.44    + 
        1      3.8e+03      0.024        1     0.63    + 
        2      3.8e+03      0.024      0.5    0.052    - 
        3      3.7e+03      0.029      0.5     0.52    + 
        4      3.7e+03      0.028      0.5     0.39    + 
        5      3.7e+03      0.032        5     0.97   ++ 
        6      3.7e+03      0.032      2.5       -2    - 
        7      3.6e+03      0.018      2.5     0.51    + 
        8      3.6e+03      0.018      1.2     -2.6    - 
        9      3.6e+03      0.018     0.62     -2.7    - 
       10      3.6e+03      0.018     0.31       -1    - 
       11      3.6e+03      0.018     0.16    0.057    - 
       12      3.6e+03     0.0069     0.16     0.59    + 
       13      3.6e+03      0.011     0.16     0.19    + 
       14      3.6e+03     0.0036     0.16     0.68    + 
       15      3.6e+03     0.0073     0.16     0.58    + 
       16      3.6e+03     0.0029     0.16     0.81    + 
       17      3.6e+03     0.0055     0.16     0.65    + 
       18      3.6e+03     0.0039     0.16     0.36    + 
       19      3.6e+03     0.0045      1.6        1   ++ 
       20      3.6e+03     0.0045     0.78     -2.7    - 
       21      3.6e+03     0.0045     0.39    -0.61    - 
       22      3.6e+03     0.0099     0.39     0.15    + 
       23      3.6e+03     0.0099      0.2   -0.024    - 
       24      3.6e+03     0.0045      0.2     0.46    + 
       25      3.6e+03     0.0038      0.2     0.24    + 
       26      3.5e+03     0.0013      0.2     0.69    + 
       27      3.5e+03     0.0025      0.2     0.39    + 
       28      3.5e+03     0.0015        2     0.96   ++ 
       29      3.5e+03     0.0015     0.98      -25    - 
       30      3.5e+03     0.0015     0.49      -14    - 
       31      3.5e+03     0.0015     0.24       -5    - 
       32      3.5e+03     0.0015     0.12     -2.3    - 
       33      3.5e+03     0.0015    0.061     -1.6    - 
       34      3.5e+03     0.0015    0.031    -0.38    - 
       35      3.5e+03     0.0028    0.031     0.43    + 
       36      3.5e+03     0.0028    0.015     0.02    - 
       37      3.5e+03    0.00093    0.015     0.54    + 
       38      3.5e+03    0.00092    0.015     0.88    + 
       39      3.5e+03    0.00087    0.015     0.89    + 
       40      3.5e+03    0.00091    0.015     0.89    + 
       41      3.5e+03    0.00087     0.15     0.97   ++ 
       42      3.5e+03    0.00092      1.5     0.99   ++ 
       43      3.5e+03    0.00092     0.76    -0.67    - 
       44      3.5e+03     0.0022      7.6      1.1   ++ 
       45      3.5e+03     0.0022      3.1      -26    - 
       46      3.5e+03     0.0022      1.6      -12    - 
       47      3.5e+03     0.0022     0.78     -9.8    - 
       48      3.5e+03     0.0022     0.39     -2.9    - 
       49      3.5e+03     0.0022      0.2    -0.81    - 
       50      3.5e+03     0.0022    0.098    -0.17    - 
       51      3.5e+03      0.002    0.098     0.63    + 
       52      3.5e+03      0.003    0.098      0.4    + 
       53      3.5e+03     0.0013    0.098     0.88    + 
       54      3.5e+03     0.0024    0.098     0.73    + 
       55      3.5e+03    0.00096    0.098      0.9    + 
       56      3.5e+03     0.0013     0.98     0.93   ++ 
       57      3.5e+03     0.0014     0.98     0.49    + 
       58      3.5e+03     0.0014     0.49       -4    - 
       59      3.5e+03     0.0014     0.24       -2    - 
       60      3.5e+03     0.0014     0.12     -1.3    - 
       61      3.5e+03     0.0015     0.12     0.14    + 
       62      3.5e+03     0.0013     0.12     0.43    + 
       63      3.5e+03     0.0013    0.061    -0.83    - 
       64      3.5e+03     0.0011    0.061     0.65    + 
       65      3.5e+03     0.0011    0.031    -0.19    - 
       66      3.5e+03    0.00053    0.031     0.44    + 
       67      3.5e+03    0.00054    0.031     0.27    + 
       68      3.5e+03     0.0004    0.031     0.84    + 
       69      3.5e+03    0.00037    0.031     0.46    + 
       70      3.5e+03    0.00044    0.031     0.26    + 
       71      3.5e+03    0.00033    0.031     0.69    + 
       72      3.5e+03    0.00033    0.015   -0.036    - 
       73      3.5e+03    0.00034    0.015     0.51    + 
       74      3.5e+03    0.00023    0.015     0.89    + 
       75      3.5e+03    0.00025     0.15     0.92   ++ 
       76      3.5e+03    0.00027     0.15     0.77    + 
       77      3.5e+03    0.00027    0.077     -2.9    - 
       78      3.5e+03    0.00027    0.038     -1.8    - 
       79      3.5e+03    0.00027    0.019    -0.59    - 
       80      3.5e+03    0.00027   0.0096    0.083    - 
       81      3.5e+03    0.00022   0.0096      0.8    + 
       82      3.5e+03    0.00022   0.0048    -0.25    - 
       83      3.5e+03    0.00021   0.0048     0.49    + 
       84      3.5e+03    0.00013   0.0048     0.81    + 
       85      3.5e+03    0.00013    0.048     0.96   ++ 
       86      3.5e+03    0.00013     0.48     0.96   ++ 
       87      3.5e+03    0.00013     0.16     -3.2    - 
       88      3.5e+03    0.00013    0.079     -2.2    - 
       89      3.5e+03    0.00013     0.04    -0.84    - 
       90      3.5e+03     0.0001     0.04     0.37    - 
    Results saved in file b16panel_discrete_socio_eco.html 
    Results saved in file b16panel_discrete_socio_eco.pickle 


.. GENERATED FROM PYTHON SOURCE LINES 182-184

.. code-block:: Python

    print(results.short_summary())


.. rst-class:: sphx-glr-script-out

 .. code-block:: none

    Results for model b16panel_discrete_socio_eco
    Nbr of parameters:              16
    Sample size:                    752
    Excluded data:                  0
    Final log likelihood:           -3545.388
    Akaike Information Criterion:   7122.776
    Bayesian Information Criterion: 7196.74


.. GENERATED FROM PYTHON SOURCE LINES 185-187

.. code-block:: Python

    pandas_results = results.get_estimated_parameters()
    pandas_results


.. raw:: html

    <div class="output_subarea output_html rendered_html output_result">
    <div>
    <style scoped>
        .dataframe tbody tr th:only-of-type {
            vertical-align: middle;
        }

        .dataframe tbody tr th {
            vertical-align: top;
        }

        .dataframe thead th {
            text-align: right;
        }
    </style>
    <table border="1" class="dataframe">
      <thead>
        <tr style="text-align: right;">
          <th></th>
          <th>Value</th>
          <th>Rob. Std err</th>
          <th>Rob. t-test</th>
          <th>Rob. p-value</th>
        </tr>
      </thead>
      <tbody>
        <tr>
          <th>ASC_CAR_S_class0</th>
          <td>7.897693</td>
          <td>1.267046</td>
          <td>6.233152</td>
          <td>4.571428e-10</td>
        </tr>
        <tr>
          <th>ASC_CAR_S_class1</th>
          <td>2.770215</td>
          <td>0.383546</td>
          <td>7.222647</td>
          <td>5.098144e-13</td>
        </tr>
        <tr>
          <th>ASC_CAR_class0</th>
          <td>-3.932036</td>
          <td>0.778884</td>
          <td>-5.048296</td>
          <td>4.457687e-07</td>
        </tr>
        <tr>
          <th>ASC_CAR_class1</th>
          <td>0.928194</td>
          <td>0.303702</td>
          <td>3.056265</td>
          <td>2.241133e-03</td>
        </tr>
        <tr>
          <th>ASC_SM_S_class0</th>
          <td>1.721648</td>
          <td>0.641441</td>
          <td>2.684031</td>
          <td>7.274033e-03</td>
        </tr>
        <tr>
          <th>ASC_SM_S_class1</th>
          <td>0.787746</td>
          <td>0.346705</td>
          <td>2.272093</td>
          <td>2.308088e-02</td>
        </tr>
        <tr>
          <th>ASC_TRAIN_S_class0</th>
          <td>2.551066</td>
          <td>0.831508</td>
          <td>3.068001</td>
          <td>2.154959e-03</td>
        </tr>
        <tr>
          <th>ASC_TRAIN_S_class1</th>
          <td>2.114956</td>
          <td>0.275435</td>
          <td>7.678595</td>
          <td>1.598721e-14</td>
        </tr>
        <tr>
          <th>ASC_TRAIN_class0</th>
          <td>-1.062234</td>
          <td>0.784735</td>
          <td>-1.353622</td>
          <td>1.758571e-01</td>
        </tr>
        <tr>
          <th>ASC_TRAIN_class1</th>
          <td>-0.619074</td>
          <td>0.328846</td>
          <td>-1.882567</td>
          <td>5.975908e-02</td>
        </tr>
        <tr>
          <th>B_COST_class0</th>
          <td>-1.923801</td>
          <td>0.515813</td>
          <td>-3.729646</td>
          <td>1.917486e-04</td>
        </tr>
        <tr>
          <th>B_COST_class1</th>
          <td>-4.255385</td>
          <td>0.319924</td>
          <td>-13.301230</td>
          <td>0.000000e+00</td>
        </tr>
        <tr>
          <th>B_TIME_S_class1</th>
          <td>2.165545</td>
          <td>0.163520</td>
          <td>13.243302</td>
          <td>0.000000e+00</td>
        </tr>
        <tr>
          <th>B_TIME_class1</th>
          <td>-6.682184</td>
          <td>0.478450</td>
          <td>-13.966314</td>
          <td>0.000000e+00</td>
        </tr>
        <tr>
          <th>CLASS_CTE</th>
          <td>-0.639407</td>
          <td>0.382361</td>
          <td>-1.672259</td>
          <td>9.447322e-02</td>
        </tr>
        <tr>
          <th>CLASS_INC</th>
          <td>-0.260331</td>
          <td>0.172839</td>
          <td>-1.506204</td>
          <td>1.320149e-01</td>
        </tr>
      </tbody>
    </table>
    </div>
    </div>
    <br />
    <br />


.. rst-class:: sphx-glr-timing

   **Total running time of the script:** (12 minutes 21.457 seconds)


.. _sphx_glr_download_auto_examples_swissmetro_plot_b16panel_discrete_socio_eco.py:

.. only:: html

  .. container:: sphx-glr-footer sphx-glr-footer-example

    .. container:: sphx-glr-download sphx-glr-download-jupyter

      :download:`Download Jupyter notebook: plot_b16panel_discrete_socio_eco.ipynb <plot_b16panel_discrete_socio_eco.ipynb>`

    .. container:: sphx-glr-download sphx-glr-download-python

      :download:`Download Python source code: plot_b16panel_discrete_socio_eco.py <plot_b16panel_discrete_socio_eco.py>`

    .. container:: sphx-glr-download sphx-glr-download-zip

      :download:`Download zipped: plot_b16panel_discrete_socio_eco.zip <plot_b16panel_discrete_socio_eco.zip>`


.. only:: html

 .. rst-class:: sphx-glr-signature

    `Gallery generated by Sphinx-Gallery <https://sphinx-gallery.github.io>`_