Skip to contents padding-top: 70px;

How Do I Install An Old Version of PhotoGEA?

All PhotoGEA releases are tagged on GitHub, and the tag names are formatted as vX.Y.Z, where X.Y.Z is the version number. Because of this, remotes::install_github can be used to install old versions from within R, supplying the appropriate tag name as the ref input argument. For example, version 1.0.0 can be installed as follows:

remotes::install_github('eloch216/PhotoGEA', ref = 'v1.0.0')

Note that this command requires the remotes package, which can be installed from within R by typing install.packages('remotes').

How Do I Install The Development Version of PhotoGEA?

The development version of PhotoGEA contains the latest changes, although it may be a “work in progress” and subject to suddent changes without any warning. It can always be found on the unreleased branch on GitHub, and the package can be installed from this branch as follows:

remotes::install_github('eloch216/PhotoGEA', ref = 'unreleased')

Note that this command requires the remotes package, which can be installed from within R by typing install.packages('remotes').

What Should I Do When check_response_curve_data Fails?

When beginning to work with a new data set, it is fairly common to encounter problems that cause check_response_curve_data to fail. The most common issues are:

  • The manual log button was accidentally pressed while measuring, causing a response curve to have an extra point.

  • One or more “User Constant” was not updated before starting a to measure a new curve, causing two curves to have the same identifying metadata.

  • A curve was started, but needed to be ended early, resulting in a curve with fewer points than expected.

These are exactly the kinds of issues that check_response_curve_data is designed to detect. It may feel irritating to encounter these errors, but it’s better to know about issues with extra log points or user constants early in your analysis, before they cause additional downstream issues.

In each of these cases, it is often simplest to fix the issues by manually editing the log files in Excel. This is an easy way to delete extra rows or alter the values of user constants. When altering log files, it is always a good idea to keep an “original” version in case any of the changes need to be reverted.

Sometimes check_response_curve_data fails for a different reason – for example, perhaps different setpoint sequences were intentionally used while measuring the curves, or perhaps you have already cleaned up your data in Excel. In these cases, please see the “What If My Response Curves Have Different Numbers of Points?” and “What If I Clean My Data In Excel?” sections of this article.

What If My Response Curves Have Different Numbers of Points?

The analysis guides use data sets where each response curve has the same number of points, making it easy to apply check_response_curve_data and organize_response_curve_data. However, data sets may sometimes have curves measured using different sequences of setpoints.

To deal with this situation, one strategy is to split the full set into groups that are expected to use the same sequences, then separately check and organize each group, and finally recombine all the groups back together. The code snippets below show two examples of how this could be accomplished.

Splitting response curves according to the number of points in each curve

The following code would replace the calls to check_response_curve_data and organize_response_curve_data in the user guides. This code was originally written for a set of A-Ci curves that used different numbers of recovery points. The user wished to keep the final recovery point to use for subsequent analysis.

The curves used the following sequences of CO2_r setpoint values:

  • 16 points: 400, 300, 200, 120, 70, 30, 10, 400, 400, 400, 600, 800, 1200, 1500, 1800, 400

  • 18 points: 400, 300, 200, 120, 70, 30, 10, 400, 400, 400, 400, 400, 600, 800, 1200, 1500, 1800, 400

  • 19 points: 400, 300, 200, 120, 70, 30, 10, 400, 400, 400, 400, 400, 400, 600, 800, 1200, 1500, 1800, 400

# Add a new column called `curve_npts` that stores the number of points in each
# response curve
licor_data <- do.call(rbind, by(licor_data, licor_data[, 'curve_identifier'], function(x) {
  x[, 'curve_npts'] <- nrow(x)
  x
}))

# Choose points to remove, depending on how many points are in the curve
pts_to_remove <- list(
  '16' = c(1, 8:9,  16),
  '18' = c(1, 8:11, 18),
  '19' = c(1, 8:12, 19)
)

# Check and process each group of curves depending on how many points are in the
# curve
licor_exdf_list_processed <- by(licor_data, licor_data[, 'curve_npts'], function(x) {
  # Get the number of points in these curves
  npts <- x[1, 'curve_npts']

  # Make sure info is specified for this group of curves
  if (!as.character(npts) %in% names(pts_to_remove)) {
    stop('Points to remove were not specified for npts = `', npts, '`')
  }

  # Make sure the data meets basic requirements
  check_response_curve_data(x, 'curve_identifier', npts, 'CO2_r_sp')

  # Remove points with duplicated `CO2_r_sp` values and order by `Ci`
  organize_response_curve_data(
    x,
    'curve_identifier',
    pts_to_remove[[as.character(npts)]],
    'Ci'
  )
})

# Use `rbind` to recombine all the data
licor_data <- do.call(rbind, licor_exdf_list_processed)

Splitting the response curves according to the date they were measured and the number of points in each curve

This code would replace the calls to check_response_curve_data and organize_response_curve_data in the user guides. This code was originally written for a set of A-Ci curves that used different sequences of CO2_r setpoints on different days.

The curves used the following sequences of CO2_r setpoint values:

  • 2023-03-21: 18 points: 400, 300, 200, 150, 100, 75, 50, 40, 30, 20, 10, 400, 400, 600, 800, 1000, 1200, 1500

  • 2023-03-23: 19 points: 400, 300, 200, 150, 100, 75, 50, 40, 30, 20, 10, 400, 400, 500, 600, 800, 1000, 1200, 1500

  • 2023-03-24: 19 points: 400, 300, 200, 150, 100, 75, 50, 40, 30, 20, 400, 400, 450, 500, 600, 800, 1000, 1200, 1500

# Add a new column called `curve_npts` that stores the number of points in each
# response curve
licor_data <- do.call(rbind, by(licor_data, licor_data[, 'curve_identifier'], function(x) {
  x[, 'curve_npts'] <- nrow(x)
  x
}))

# Add a new column called `date_ymd` that stores the date formatted as
# YYYY-MM-DD
licor_data[, 'date_ymd'] <- paste(
  substring(licor_data[, 'date'], 1, 4),
  substring(licor_data[, 'date'], 5, 6),
  substring(licor_data[, 'date'], 7, 8),
  sep = '-'
)

# Add a new column called `date_ymd_npts` that combines the date and the number
# of points
licor_data[, 'date_ymd_npts'] <-
  paste(licor_data[, 'date_ymd'], licor_data[, 'curve_npts'], sep = ' - ')

# Choose points to remove, depending on the date the curve was measured and the
# number of points it contains
pts_to_remove <- list(
  '2023-03-21 - 18' = c(12, 13),
  '2023-03-23 - 19' = c(12, 13),
  '2023-03-24 - 19' = c(11, 12)
)

# Check and process each group of curves depending on the date and the number of
# points
licor_exdf_list_processed <- by(licor_data, licor_data[, 'date_ymd_npts'], function(x) {
  # Get the date and number of points in these curves
  date_ymd_npts <- x[1, 'date_ymd_npts']
  npts <- x[1, 'curve_npts']

  # Make sure info is specified for this group of curves
  if (!date_ymd_npts %in% names(pts_to_remove)) {
    stop('Points to remove were not specified for date_ymd_npts = `', date_ymd_npts, '`')
  }

  # Make sure the data meets basic requirements
  check_response_curve_data(x, 'curve_identifier', npts, 'CO2_r_sp')

  # Remove points with duplicated `CO2_r_sp` values and order by `Ci`
  organize_response_curve_data(
    x,
    'curve_identifier',
    pts_to_remove[[date_ymd_npts]],
    'Ci'
  )
})

# Use `rbind` to recombine all the data
licor_data <- do.call(rbind, licor_exdf_list_processed)

What If I Clean My Data In Excel?

In the user guides, the organize_response_curve_data and remove_points functions are used to remove recovery points and other points from sets of response curves. However, it is also possible to remove points in Excel before reading log files into R. In this case, it is necessary to make a few small alterations to the code used in the user guides.

One consideration is that after cleaning the curves in Excel, it is likely that not all curves have the same number of points or follow the same sequence of setpoint values. Because of this, the checks in check_response_curve_data are likely to fail. In this case, we recommend setting expected_npts to 0 (the default value) and error_on_failure to FALSE when calling check_response_curve_data. This will provide potentially useful information about the number of points in each curve, but won’t throw an error that would cause a script to stop running.

Another consideration is that organize_response_curve_data is not needed to remove any points from the curves. Yet, its other features, such as reordering the points and calculating average values, are still useful. In this case, we recommend setting measurement_numbers_to_remove to c() and leaving the other arguments as-is.

Putting this all together would produce something like the following code, which would replace the regular calls to check_response_curve_data and organize_response_curve_data from the user guides:

# Print info about the number of points in each curve to make sure
# `curve_identifier` is able to properly split the set into individual curves
check_response_curve_data(licor_data, 'curve_identifier', error_on_failure = FALSE)

# Reorder by `Ci` and calculate average values of leaf temperature and Qin
licor_data <- organize_response_curve_data(
    licor_data,
    'curve_identifier',
    c(),
    'Ci',
    columns_to_average = c('TleafCnd', 'Qin')
)

What If I Have Log Files From a Licor LI-6400, PP Systems CIRAS, or Other Gas Exchange Measurement System?

At the moment, PhotoGEA can read plaintext and Excel files created by Licor LI-6800 gas exchange measurement systems, but does not have functions for reading log files from other instruments.

Fortunately, there are other R packages that do, such as GasanalyzeR. Using a function from another package is a little complicated, because generally they do not use the same data structures or variable names as PhotoGEA. So, after reading a file with another package, there is typically some extra “conversion” that needs to take place.

The following example shows how to create a wrapper for the read_6400_xls function from the gasanalyzer R package. It also provides a simple function that “detects” the file type, so a mix of Licor LI-6400 and LI-6800 log files could be used. The code in the example would replace the lines that use read_gasex_file to create licor_exdf_list in the user guides.

As of the time this article was written (April 4, 2025), the read_6400_xls function is not available in the CRAN version of the gasanalyzer package. Instead, the package must be installed directly from its GitLab repository to get the latest version. This can be done by calling the following command from R:

remotes::install_gitlab('plantphys/gasanalyzer')

This example only processes the columns that are absolutely essential for fitting A-Ci curves. Depending on your goals, you may need to modify this code to include other columns.

# Helper function for reading 6400 Excel files. This is a wrapper for the
# read_6400_xls function from the gasanalyzer package that converts the output
# into a format usable by PhotoGEA.
#
# Debug option: If DEBUG_PRINT is set to TRUE below, the raw column names will
# be printed immediately after the file is loaded. This may be helpful if there
# are other columns (such as user constants) that also need to be renamed.
read_6400_xls_wrapper <- function(fpath) {
    # Read the contents of the file
    rawdata <- gasanalyzer::read_6400_xls(fpath)

    # Optional debug printing
    DEBUG_PRINT <- FALSE
    if (DEBUG_PRINT) {
      print(colnames(rawdata))
    }

    # Rename a few columns so they meet PhotoGEA's expectations; the "new"
    # PhotoGEA column name is on the left, and the "original" gasanalyzer
    # column name is on the right.
    new_column_names <- list(
        A = 'GasEx.A',
        Ca = 'GasEx.Ca',
        Ci = 'GasEx.Ci',
        DeltaPcham = 'Meas.DeltaPcham',
        gsw = 'GasEx.gsw',
        oxygen = 'Const.Oxygen',
        Pa = 'Meas.Pa',
        Qin = 'LeafQ.Qin',
        TleafCnd = 'GasEx.TleafCnd'
    )

    for (i in seq_along(new_column_names)) {
        gasanalyzer_name <- new_column_names[[i]]
        photogea_name    <- names(new_column_names)[i]

        colnames(rawdata)[colnames(rawdata) == gasanalyzer_name] <- photogea_name
    }

    # Drop units and convert to a regular data frame
    rawdata_df <- as.data.frame(units::drop_units(rawdata))

    # Convert to an exdf object
    exdf_obj <- exdf(rawdata_df)

    # Supply units for a few columns so they meet PhotoGEA's expectations, and
    # return
    document_variables(
        exdf_obj,
        c('read_6400_xls_wrapper', 'A',          'micromol m^(-2) s^(-1)'),
        c('read_6400_xls_wrapper', 'Ca',         'micromol mol^(-1)'),
        c('read_6400_xls_wrapper', 'Ci',         'micromol mol^(-1)'),
        c('read_6400_xls_wrapper', 'DeltaPcham', 'kPa'),
        c('read_6400_xls_wrapper', 'gsw',        'mol m^(-2) s^(-1)'),
        c('read_6400_xls_wrapper', 'oxygen',     'percent'),
        c('read_6400_xls_wrapper', 'Pa',         'kPa'),
        c('read_6400_xls_wrapper', 'Qin',        'micromol m^(-2) s^(-1)'),
        c('read_6400_xls_wrapper', 'TleafCnd',   'degrees C')
    )
}

# Helper function that tries to read a file using PhotoGEA::read_gasex_file, but
# then uses `read_6400_xls_wrapper` if there is an error.
read_gasex_file_plus_6400 <- function(fpath) {
  cat(paste0(
    '\nAttempting to read `', fpath, '` using PhotoGEA::read_gasex_file\n'
  ))

  tryCatch(
    read_gasex_file(fpath),
    error = function(e) {
      cat(paste0(
        '\nAn error occurred. Attempting to read `', fpath,
        '` using gasanalyzer::read_6400_xls\n'
      ))

      read_6400_xls_wrapper(fpath)
    }
  )
}

# Load each file, storing the result in a list
licor_exdf_list <- lapply(file_paths, function(fpath) {
  read_gasex_file_plus_6400(fpath)
})

What If My Files Don’t Have User Constants or Other Metadata?

Please see the Guide to Licor LI-6800 User Constants.