This page was generated from docs/source/notebooks/quickstart/quickstart.ipynb. Interactive online version:

Import the data into your project

As long as you can read your data into a xarray.Dataset or xarray.DataArray you can directly import it in to your project.

This will save your data as NetCDF (.nc) file into the data folder inside of your project with the name that you gave it (here quickstart_project/data/my_data.nc).

If the data format you are using is supported by a plugin you can simply copy the file to the data folder of the project (here quickstart_project/data).

[6]:

quickstart_project.import_data(my_dataset, dataset_name="my_data")
quickstart_project

[6]:

Project (quickstart_project)

pyglotaran version: 0.7.2

Data

my_data

Model

my_model

Parameters

my_parameters

Results

None

After importing our quickstart_project is aware of the data that we named my_data when importing.

Working with models

To analyze our data, we need to create a model.

Create a file called my_model.yaml in your projects models directory and fill it with the following content.

[9]:

quickstart_project.show_model_definition("my_model")

[9]:

default_megacomplex: decay

initial_concentration:
  input:
    compartments: [s1, s2, s3]
    parameters: [input.1, input.0, input.0]

k_matrix:
  k1:
    matrix:
      (s2, s1): kinetic.1
      (s3, s2): kinetic.2
      (s3, s3): kinetic.3

megacomplex:
  m1:
    k_matrix: [k1]

irf:
  irf1:
    type: gaussian
    center: irf.center
    width: irf.width

dataset:
  my_data:
    initial_concentration: input
    megacomplex: [m1]
    irf: irf1

You can check your model for problems with the validate method.

[10]:

quickstart_project.validate("my_model")

[10]:

Your model is valid.

Working with parameters

Now define some starting parameters. Create a file called parameters.yaml in your projects parameters directory with the following content.

[11]:

quickstart_project.show_parameters_definition("my_parameters")

[11]:

input:
  - ["1", 1, { "vary": False }]
  - ["0", 0, { "vary": False }]

kinetic: [0.51, 0.31, 0.11]

irf:
  - ["center", 0.31]
  - ["width", 0.11]

Note the { "vary": False } which tells pyglotaran that those parameters should not be changed.

You can use validate method also to check for missing parameters.

[12]:

quickstart_project.validate("my_model", "my_parameters")

[12]:

Your model is valid.

Since not all problems in the model can be detected automatically it is wise to visually inspect the model. For this purpose, you can just load the model and inspect its markdown rendered version.

[13]:

quickstart_project.load_model("my_model")

[13]:

Model

Dataset Groups

default
- Label: default
- Residual Function: variable_projection

K Matrix

k1
- Label: k1
- Matrix: {(‘s2’, ‘s1’): ‘kinetic.1’, (‘s3’, ‘s2’): ‘kinetic.2’, (‘s3’, ‘s3’): ‘kinetic.3’}

Megacomplex

m1
- Label: m1
- Dimension: time
- Type: decay
- K Matrix: [‘k1’]

Initial Concentration

input
- Label: input
- Compartments: [‘s1’, ‘s2’, ‘s3’]
- Parameters: [‘input.1’, ‘input.0’, ‘input.0’]
- Exclude From Normalize: []

Irf

irf1
- Label: irf1
- Normalize: True
- Backsweep: False
- Type: gaussian
- Center: irf.center
- Width: irf.width

Dataset

my_data
- Label: my_data
- Group: default
- Force Index Dependent: False
- Megacomplex: [‘m1’]
- Initial Concentration: input
- Irf: irf1

The same way you should inspect your parameters.

[14]:

quickstart_project.load_parameters("my_parameters")

[14]:

input:

Label

Value

Standard Error

t-value

Minimum

Maximum

Vary

Non-Negative

Expression

1

1.000e+00

nan

nan

-inf

inf

False

False

None

0

0.000e+00

nan

nan

-inf

inf

False

False

None
irf:

Label

Value

Standard Error

t-value

Minimum

Maximum

Vary

Non-Negative

Expression

center

3.100e-01

nan

nan

-inf

inf

True

False

None

width

1.100e-01

nan

nan

-inf

inf

True

False

None

kinetic:

Label	Value	Standard Error	t-value	Minimum	Maximum	Vary	Non-Negative	Expression
1	5.100e-01	nan	nan	-inf	inf	True	False	`None`
2	3.100e-01	nan	nan	-inf	inf	True	False	`None`
3	1.100e-01	nan	nan	-inf	inf	True	False	`None`

Optimizing data

Now we have everything together to optimize our parameters.

[15]:

result = quickstart_project.optimize("my_model", "my_parameters")
result

/home/docs/checkouts/readthedocs.org/user_builds/pyglotaran/conda/latest/lib/python3.10/site-packages/glotaran/optimization/data_provider.py:615: UserWarning: The `squeeze` kwarg to GroupBy is being removed.Pass .groupby(..., squeeze=False) to disable squeezing, which is the new default, and to silence this warning.
  ["".join(sub_arr.values) for _, sub_arr in aligned_groups.groupby("global")]

   Iteration     Total nfev        Cost      Cost reduction    Step norm     Optimality
       0              1         1.1176e+04                                    1.73e+06
       1              2         1.4967e+01      1.12e+04       1.96e-02       1.27e+04

       2              3         7.5262e+00      7.44e+00       5.86e-03       1.01e+03
       3              4         7.5223e+00      3.87e-03       1.71e-05       6.56e-02
       4              5         7.5223e+00      1.63e-11       1.86e-09       8.79e-06
Both `ftol` and `xtol` termination conditions are satisfied.
Function evaluations 5, initial cost 1.1176e+04, final cost 7.5223e+00, first-order optimality 8.79e-06.

[15]:

Optimization Result
Number of residual evaluation	5
Number of residuals	151200
Number of free parameters	5
Number of conditionally linear parameters	216
Degrees of freedom	150979
Chi Square	1.50e+01
Reduced Chi Square	9.96e-05
Root Mean Square Error (RMSE)	9.98e-03

Model

Dataset Groups

default
- Label: default
- Residual Function: variable_projection

K Matrix

k1
- Label: k1
- Matrix: {(‘s2’, ‘s1’): ‘kinetic.1(5.00e-01±6.77e-05, t-value: 7380, initial: 5.10e-01)’, (‘s3’, ‘s2’): ‘kinetic.2(3.00e-01±3.93e-05, t-value: 7637, initial: 3.10e-01)’, (‘s3’, ‘s3’): ‘kinetic.3(1.00e-01±4.22e-06, t-value: 23711, initial: 1.10e-01)’}

Megacomplex

m1
- Label: m1
- Dimension: time
- Type: decay
- K Matrix: [‘k1’]

Initial Concentration

input
- Label: input
- Compartments: [‘s1’, ‘s2’, ‘s3’]
- Parameters: [‘input.1(1.00e+00, fixed)’, ‘input.0(0.00e+00, fixed)’, ‘input.0(0.00e+00, fixed)’]
- Exclude From Normalize: []

Irf

irf1
- Label: irf1
- Normalize: True
- Backsweep: False
- Type: gaussian
- Center: irf.center(3.00e-01±5.03e-06, t-value: 59666, initial: 3.10e-01)
- Width: irf.width(1.00e-01±6.71e-06, t-value: 14906, initial: 1.10e-01)

Dataset

my_data
- Label: my_data
- Group: default
- Force Index Dependent: False
- Megacomplex: [‘m1’]
- Initial Concentration: input
- Irf: irf1

Each time you run an optimization the result will be saved in the projects results folder.

[16]:

%ls "quickstart_project/results"

my_model_run_0000/

To visualize how quickly the optimization converged we ca plot the optimality of the optimization_history.

[17]:

result.optimization_history.data["optimality"].plot(logy=True)

[17]:

<Axes: xlabel='iteration'>

../../_images/notebooks_quickstart_quickstart_38_1.svg

[18]:

result.optimized_parameters

[18]:

input:

Label

Value

Standard Error

t-value

Minimum

Maximum

Vary

Non-Negative

Expression

1

1.000e+00

nan

nan

-inf

inf

False

False

None

0

0.000e+00

nan

nan

-inf

inf

False

False

None

irf:

Label	Value	Standard Error	t-value	Minimum	Maximum	Vary	Non-Negative	Expression
center	3.000e-01	5.028e-06	59666	-inf	inf	True	False	`None`
width	1.000e-01	6.709e-06	14906	-inf	inf	True	False	`None`

kinetic:

Label	Value	Standard Error	t-value	Minimum	Maximum	Vary	Non-Negative	Expression
1	4.999e-01	6.774e-05	7380	-inf	inf	True	False	`None`
2	3.000e-01	3.929e-05	7637	-inf	inf	True	False	`None`
3	1.000e-01	4.217e-06	23711	-inf	inf	True	False	`None`

You can inspect the data of your result by accessing data attribute. In our example it only contains our single my_data dataset, but it ca contain as many dataset as you analysis needs.

[19]:

result.data

Visualize the Result

The resulting data can be visualized the same way as the dataset. To judge the quality of the fit, you should look at first left and right singular vectors of the residual.

[20]:

result_dataset = result.data["my_data"]

residual_left = result_dataset.residual_left_singular_vectors.sel(left_singular_value_index=0)
residual_right = result_dataset.residual_right_singular_vectors.sel(right_singular_value_index=0)
residual_left.plot.line(x="time", aspect=2, size=5)
residual_right.plot.line(x="spectral", aspect=2, size=5);

../../_images/notebooks_quickstart_quickstart_43_0.svg

../../_images/notebooks_quickstart_quickstart_43_1.svg

Quickstart/Cheat-Sheet

Project (quickstart_project)

Data

Model

Parameters

Results

Plotting raw data

Import the data into your project

Project (quickstart_project)

Data

Model

Parameters

Results

Preparing data

Working with models

Working with parameters

Model

Dataset Groups

K Matrix

Megacomplex

Initial Concentration

Irf

Dataset

Optimizing data

Model

Dataset Groups

K Matrix

Megacomplex

Initial Concentration

Irf

Dataset

Visualize the Result