LiberTEM - Open Pixelated STEM platform

docs gitter azure github codeclimate precommit joss zenodo pypi condaforge

LiberTEM is an open source platform for high-throughput distributed processing of large-scale binary data sets and live data streams using a modified MapReduce programming model. The current focus is pixelated scanning transmission electron microscopy (STEM) [MNR+16, Oph19] and scanning electron beam diffraction data.

MapReduce-like processing allows to specify an algorithm through two functions: One function that is mapped on portions of the input data, and another function that merges (reduces) a partial result from this mapping step into the complete result. A wide range of TEM and 4D STEM processing tasks can be expressed in this fashion, see Applications.

The UDF interface of LiberTEM offers a standardized, versatile API to decouple the mathematical core of an algorithm from details of data source, parallelism, and use of results. Mapping and merging can be performed in any order and with different subdivisions of the input data, including running parts of the calculation concurrently. That means the same implementation can be used in a wide range of modalities, including massive scaling on clusters. Since each merge step produces an intermediate result, this style of processing is suitable for displaying live results from a running calculation in a GUI application and for processing live data streams. A closed-loop feedback between processing and instrument control can be realized as well. See User-defined functions for more details on the LiberTEM UDF interface.

The LiberTEM back-end offers high throughput and scalability on PCs, single server nodes, clusters and cloud services. On clusters it can use fast distributed local storage on high-performance SSDs. That way it achieves very high aggregate IO performance on a compact and cost-efficient system built from stock components. All CPU cores and CUDA devices in a system can be used in parallel.

LiberTEM is supported on Linux, Mac OS X and Windows. Other platforms that allow installation of Python 3.7+ and the required packages will likely work as well. The GUI is running in a web browser.

Installation

The short version:

$ virtualenv -p python3 ~/libertem-venv/
$ source ~/libertem-venv/bin/activate
(libertem-venv) $ python -m pip install "libertem[torch]"

# optional for GPU support
# See also https://docs.cupy.dev/en/stable/install.html
(libertem-venv) $ python -m pip install cupy

Please see our documentation for details!

Alternatively, to run the LiberTEM Docker image:

$ docker run -p localhost:9000:9000 --mount type=bind,source=/path/to/your/data/,dst=/data/,ro ghcr.io/libertem/libertem

or

$ singularity exec docker://ghcr.io/libertem/libertem /venv/bin/libertem-server

Deployment for offline data processing on a single-node system for a local user is thoroughly tested and can be considered stable. Deployment on a cluster is experimental and still requires some additional work, see Issue #105. Back-end support for live data processing is still experimental as well, see https://github.com/LiberTEM/LiberTEM-live.

Applications

Since LiberTEM is programmable through user-defined functions (UDFs), it can be used for a wide range of processing tasks on array-like data and data streams. The following applications have been implemented already:

Some of these applications are available through an interactive web GUI. Please see the applications section of our documentation for details!

The Python API and user-defined functions (UDFs) can be used for complex operations such as arbitrary linear operations and other features like data export. Example Jupyter notebooks are available in the examples directory. If you are having trouble running the examples, please let us know by filing an issue or by joining our Gitter chat.

LiberTEM is suitable as a high-performance processing backend for other applications, including live data streams. Contact us if you are interested!

LiberTEM is evolving rapidly and prioritizes features following user demand and contributions. Currently we are working on live data processing, improving application support for sparse data and event-based detectors, performance improvements for GPU processing, and implementing analysis methods for various applications of pixelated STEM and other large-scale detector data. If you like to influence the direction this project is taking, or if you’d like to contribute, please join our gitter chat and our general mailing list.

File formats

LiberTEM currently opens most file formats used for pixelated STEM. See our general information on loading data and format-specific documentation for more information!

  • Raw binary files

  • NumPy .npy binary files

  • Thermo Fisher EMPAD detector [TPC+16] files

  • Quantum Detectors MIB format

  • Nanomegas .blo block files

  • Direct Electron DE5 files (HDF5-based) and Norpix SEQ files for DE-Series detectors

  • Gatan K2 IS raw format

  • Stacks of Gatan DM3 and DM4 files (via openNCEM)

  • Single-file Gatan DM4 scans when saved using C-ordering

  • FRMS6 from PNDetector pnCCD cameras [SRB+15] (currently alpha, gain correction still needs UI changes)

  • FEI SER files (via openNCEM)

  • MRC (via openNCEM)

  • HDF5-based formats such as HyperSpy files, NeXus and EMD

  • TVIPS binary files

  • Sparse data in Raw CSR (compressed sparse row) format, as is possible to generate from event-based detectors

  • Please contact us if you are interested in support for an additional format!

Live processing and detectors (experimental)

See LiberTEM-live!

License

LiberTEM is licensed under GPLv3. The I/O parts are also available under the MIT license, please see LICENSE files in the subdirectories for details.

Acknowledgements

We are very grateful for your continuing support for LiberTEM!

See the acknowledgement page for a list of authors and contributors to LiberTEM and its subprojects. See also our info on funding and industry partners.

Documentation

User manual

Indices and tables

Citations

[GT97]

J. M. Gibson and M. M. J. Treacy. Diminished medium-range order observed in annealed amorphous germanium. Physical Review Letters, 78(6):1074–1077, feb 1997. doi:10.1103/physrevlett.78.1074.

[GGM+19a]

Giulio Guzzinati, Wannes Ghielens, Christoph Mahr, Armand Béché, Andreas Rosenauer, Toon Calders, and Jo Verbeeck. Electron bessel beam diffraction for precise and accurate nanoscale strain mapping. Applied Physics Letters, 2019. arXiv:1902.06979v3, doi:10.1063/1.5096245.

[GGM+19b]

Giulio Guzzinati, Wannes Ghielens, Christoph Mahr, Armand Béché, Andreas Rosenauer, Toon Calders, and Jo Verbeeck. Electron Bessel beam diffraction patterns, line scan of Si/SiGe multilayer. February 2019. URL: https://doi.org/10.5281/zenodo.2566137, doi:10.5281/zenodo.2566137.

[KMM+16]

Matus Krajnak, Damien McGrouther, Dzmitry Maneuski, Val O\textquotesingle Shea, and Stephen McVitie. Pixelated detectors and improved efficiency for magnetic imaging in STEM differential phase contrast. Ultramicroscopy, 165:42–50, jun 2016. doi:10.1016/j.ultramic.2016.03.006.

[LL08]

H. Lichte and M. Lehmann. Electron holography–basics and applications. Rep. Prog. Phys., 71:016102, 2008. doi:10.1088/0034-4885/71/1/016102.

[MNR+16]

Ian MacLaren, Magnus Nord, Andrew Ross, Matus Krajnak, Martin Hart, Alastair Doye, Damien McGrouther, Rantej Bali, Archan Banerjee, and Robert Hadfield. Pixelated STEM detectors: opportunities and challenges, pages 663–664. American Cancer Society, 2016. URL: https://onlinelibrary.wiley.com/doi/abs/10.1002/9783527808465.EMC2016.6284, doi:10.1002/9783527808465.EMC2016.6284.

[Oph19]

Colin Ophus. Four-dimensional scanning transmission electron microscopy (4d-STEM): from scanning nanodiffraction to ptychography and beyond. Microscopy and Microanalysis, 25(3):563–582, may 2019. doi:10.1017/s1431927619000497.

[OS19]

Colin Ophus and Benjamin Savitzky. Simulated calibration dataset for 4D scanning transmission electron microscopy. December 2019. URL: https://doi.org/10.5281/zenodo.3592520, doi:10.5281/zenodo.3592520.

[POT+19]

Ouliana Panova, Colin Ophus, Christopher J. Takacs, Karen C. Bustillo, Luke Balhorn, Alberto Salleo, Nitash Balsara, and Andrew M. Minor. Diffraction imaging of nanocrystalline structures in organic semiconductor molecular thin films. Nature Materials, jun 2019. doi:10.1038/s41563-019-0387-3.

[Pen21]

T. Pennycook. Graphene simulated dataset. January 2021. URL: https://doi.org/10.5281/zenodo.4476506, doi:10.5281/zenodo.4476506.

[Ros76]

H. Rose. Nonstandard imaging methods in electron microscopy. Ultramicroscopy, 2:251–267, 1976. URL: https://www.sciencedirect.com/science/article/pii/S0304399176915382, doi:https://doi.org/10.1016/S0304-3991(76)91538-2.

[SG18]

Erich Schubert and Michael Gertz. Numerically stable parallel computation of (co-)variance. In Proceedings of the 30th International Conference on Scientific and Statistical Database Management - SSDBM 18. ACM Press, 2018. doi:10.1145/3221269.3223036.

[SRB+15]

M. Simson, H. Ryll, H. Banba, R. Hartmann, M. Huth, S. Ihle, L. Jones, Y. Kondo, K. Muller, P.D. Nellist, R. Sagawa, J. Schmidt, H. Soltau, L. Striider, and H. Yang. 4d-STEM imaging with the pnCCD (s)TEM-camera. Microscopy and Microanalysis, 21(S3):2211–2212, aug 2015. doi:10.1017/s1431927615011836.

[SCWMC21]

Achim Strauch, Alexander Clausen, Dieter Weber, and Knut Müller-Caspary. High-resolution 4D STEM dataset of SrTiO3 along the [1 0 0] axis at high magnification. July 2021. Achim Strauch and Knut Müller-Caspary were supported by funding from the Initiative and Network Fund of the Helmholtz Association (Germany) under contract VH-NG-1317 (moreSTEM project). Knut Müller-Caspary acknowledges support from Helmholtz within the project "Ptychography 4.0" under contract ZT-I-0025. URL: https://doi.org/10.5281/zenodo.5113449, doi:10.5281/zenodo.5113449.

[SWCMC21]

Achim Strauch, Dieter Weber, Alexander Clausen, and Knut Müller-Caspary. Synthetic 4D STEM dataset based on a SrTiO3 supercell with two additional artificial spatial frequencies. July 2021. Achim Strauch and Knut Müller-Caspary were supported by funding from the Initiative and Network Fund of the Helmholtz Association (Germany) under contract VH-NG-1317 (moreSTEM project). Knut Müller-Caspary acknowledges support from Helmholtz within the project "Ptychography 4.0" under contract ZT-I-0025. URL: https://doi.org/10.5281/zenodo.5113235, doi:10.5281/zenodo.5113235.

[TPC+16]

Mark W. Tate, Prafull Purohit, Darol Chamberlain, Kayla X. Nguyen, Robert Hovden, Celesta S. Chang, Pratiti Deb, Emrah Turgut, John T. Heron, Darrell G. Schlom, Daniel C. Ralph, Gregory D. Fuchs, Katherine S. Shanks, Hugh T. Philipp, David A. Muller, and Sol M. Gruner. High dynamic range pixel array detector for scanning transmission electron microscopy. Microscopy and Microanalysis, 22(01):237–249, jan 2016. doi:10.1017/s1431927615015664.

[ZMB+19]

Steven E Zeltmann, Alexander Müller, Karen C Bustillo, Benjamin Savitzky, Lauren Hughes, Andrew M Minor, and Colin Ophus. Patterned probes for high precision 4D-STEM bragg measurements. Ultramicroscopy, 2019. arXiv:1907.05504v2, doi:10.1016/j.ultramic.2019.112890.

[LinWongJiang+14]

S. C. F. Lin, C. Y. Wong, G. Jiang, M. A. Rahman, and N. M. Kwok. Radial fourier analysis (RFA) image descriptor. In 2014 11th International Conference on Fuzzy Systems and Knowledge Discovery (FSKD), 814–819. Aug 2014. doi:10.1109/FSKD.2014.6980942.