<?xml version="1.0"?><eml:eml xmlns:eml="https://eml.ecoinformatics.org/eml-2.2.0" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:stmml="http://www.xml-cml.org/schema/stmml-1.1" packageId="ess-dive-b659b63c1c1b3c3-20220407T154139456" system="ess-dive" xsi:schemaLocation="https://eml.ecoinformatics.org/eml-2.2.0 https://eml.ecoinformatics.org/eml-2.2.0/eml.xsd">    <dataset id="dataset.1"><alternateIdentifier>paf_774_780</alternateIdentifier><title>QA/QC of the East River, Colorado, discharge and geochemical time series datasets (Almont, BCC, and Pump House) to be used for modeling of hydrogeochemical balance</title><creator id="5777663057405587">            <individualName><givenName>Boris</givenName><surName>Faybishenko</surName></individualName><organizationName>Lawrence Berkeley National Laboratory</organizationName><electronicMailAddress>bafaybishenko@lbl.gov</electronicMailAddress>                                    <userId directory="https://orcid.org">https://orcid.org/0000-0003-0085-8499</userId>        </creator><creator id="2017097988653420">            <individualName><givenName>Patricia</givenName><surName>Fox</surName></individualName><organizationName>Lawrence Berkeley National Laboratory</organizationName><electronicMailAddress>pmfox@lbl.gov</electronicMailAddress>                                    <userId directory="https://orcid.org">https://orcid.org/0000-0002-5264-1876</userId>        </creator><associatedParty id="1854042600745517"><organizationName>U.S. DOE &#x3E; Office of Science &#x3E; Biological and Environmental Research (BER)</organizationName>                        <userId directory="unknown">http://dx.doi.org/10.13039/100006206</userId>            <role>fundingOrganization</role>        </associatedParty><pubDate>2022</pubDate>                                                        <abstract><para>The following datasets were QA/QC-ed (Quality Assurance/Quality Control): 1. Brush Creek Confluence (BCC) discharge data (from Helen Malenda, USGS, Colorado School of Mines), which were calculated using the pressure transducer data and rating curves. The original 15 min time series data were presented as mean daily discharge. 2. Almont discharge data from United States Geological Survey (USGS). The original data were in 15 min time intervals, and were averaged to mean daily discharge time series. 3. Pump House discharge data as mean daily discharge (downloaded from the SFA portal). 4. BCC and Pump House chemistry data from SFA data portal and/or original spreadsheets provided by Roelof Versteeg. The following challenging QA/QC problems of the datasets were resolved: Missing data with the duration of gaps up to &#x3E;1 month; Duplicated dates; Anomalies and outliers of discharge and concentrations; Time stamps of measurements of the discharge and concentrations are not aligned (hydrogeochemical balance calculations require the timestamps to be aligned). All QA/QC-ed datasets are given as csv files. The csv files were prepared using the xts files with multiple worksheets, which are also included in the data packages. Figures of the QA/QC-ed datasets are given in the jpeg and pdf formats. The QA/QC-ed datasets have been used to quantify discharge and chemical concentrations in river water in order to understand riverine exports of water and dissolved constituents in the East River watershed. These datasets served as a basis in the presentation given by P. Fox et al. at the 2021 Goldschmidt Conference.</para></abstract><keywordSet><keyword>River discharge</keyword><keyword> geochemical data</keyword><keyword>Quality Assurance/Quality Control</keyword><keyword>QA/QC</keyword><keywordThesaurus>CATEGORICAL:NONE</keywordThesaurus></keywordSet><keywordSet><keyword>Anions</keyword><keyword>Cations</keyword><keyword>River stage/discharge</keyword><keywordThesaurus>VARIABLE:NONE</keywordThesaurus></keywordSet>                        <additionalInfo><section><title>Related References</title><para>Additional metadata on specific locations within the watershed are provided in the following related data package:</para><para>Varadharajan C ; Kakalia Z ; Banfield J ; Berkelhammer M ; Brodie E ; Christianson D ; Dafflon B ; Carbone M S ; Carroll R ; Chadwick K D ; Christensen J ; Enquist B J ; Fox P ; Henderson M ; Gochis D ; Kueppers L ; Powell T ; Matheus Carnevali P ; Singha K ; Sorensen P ; Tokunaga T ; Versteeg R ; Wilkins M ; Williams K ; Worsham M ; Wu Y ; Agarwal D (2020): Location Identifiers, Metadata, and Map for Field Measurements at the East River Watershed, Colorado, USA. Watershed Function SFA. doi:10.15485/1660962</para></section></additionalInfo>        <intellectualRights><para>This work is licensed under the Creative Commons Attribution 4.0 International License. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.</para></intellectualRights>        <coverage>            <temporalCoverage><rangeOfDates><beginDate><calendarDate>2015-01-01</calendarDate></beginDate><endDate><calendarDate>2020-12-31</calendarDate></endDate></rangeOfDates></temporalCoverage>            <geographicCoverage>                <geographicDescription>The East River (ER) is a snow‐dominated, headwater basin of the Upper Colorado River Basin (UCRB) located in the western United States. The ER is the designated testbed of Lawrence Berkeley National Laboratory's Watershed Function Scientific Focus Area (WFSFA). Through WFSFA, observational networks have been established to measure stream discharge and precipitation chemistry. The ER is considered representative of many snow‐dominated headwaters of the Rocky Mountains. The study domain encompasses nearly 85 square km, a 1.4‐km vertical drop in elevation (4,120 to 2,760 m) and pristine alpine, subalpine, montane, and riparian ecosystems. The ER contains high‐energy mountain streams to low‐energy meandering floodplains and is eroding primarily into the Cretaceous, carbon‐rich, marine shale of the Mancos Formation. Additional metadata on specific locations within the watershed are provided in the following related data package: Varadharajan C. et al. (2020) doi:10.15485/1660962</geographicDescription>                <boundingCoordinates><westBoundingCoordinate>-107.05</westBoundingCoordinate><eastBoundingCoordinate>-106.88</eastBoundingCoordinate><northBoundingCoordinate>39.034</northBoundingCoordinate><southBoundingCoordinate>38.88</southBoundingCoordinate></boundingCoordinates>            </geographicCoverage>        </coverage><contact id="7790429227799486">            <individualName><givenName>Boris</givenName><surName>Faybishenko</surName></individualName><organizationName>Lawrence Berkeley National Laboratory</organizationName><electronicMailAddress>bafaybishenko@lbl.gov</electronicMailAddress>                                    <userId directory="https://orcid.org">https://orcid.org/0000-0003-0085-8499</userId>        </contact><publisher id="7682504253105899"><organizationName>Watershed Function SFA</organizationName></publisher><methods>            <methodStep>                <description><para>QA/QC codes developed using the R libraries  lubridate, zoo, plyr, vctrs, rstatix, DescTools, xts, highfrequency, imputeTS, tsoutliers, openxlsx, data.table, anytime, dplyr, insol, dynlm. </para><para>The QA/QC procedure included: </para><para>SET A WORKING DIRECTORY and DOWNLOAD A FILE</para><para>DEVELOP A VECTOR OF DATES/TIMES IN THE as.POSIXct format</para><para>CHECK THE SUMMARY OF DATES</para><para>CHECK THE NUMBER OF NAs of DATES</para><para>Stop if class of Date is not "POSIXct"  </para><para>CREATE A VECTOR OF FLAGGING DATES WITH NAs</para><para>DETECT AND FLAG DUPLICATED DATES  ----</para><para>Summary of duplicated dates ----</para><para>Length of the duplicated dates </para><para>CREATE A VECTOR OF FLAGS FOR DUPLICATED TIME STAMPS (FLAG ALL DUPLICATES) ----</para><para>Flagging of variables:</para><para>Create a vector of the variables and name them</para><para>Suppress warnings</para><para>Summary of the variables</para><para>Plot Discharge vs temperature ----</para><para>Length of each variable ----</para><para>Create zoo files</para><para>Summary of the variables ----</para><para>Print summary of the variables –</para><para>Create a list of data frames</para><para>Total number of rows with Nas</para><para>Drop  rows with Nas</para><para>Print summary of data frames with no NAs.</para><para>The following R packages are used:</para><para>lubridate, zoo, plyr, vctrs, rstatix, DescTools, xts, highfrequency, imputeTS, tsoutliers, openxlsx, data.table, anytime, dplyr, insol, dynlm</para></description>            </methodStep>        </methods><project><title>Watershed Function SFA</title><personnel id="9149625613744880"><organizationName>Watershed Function SFA [PI: Eoin Brodie]</organizationName><role>metadataProvider</role></personnel><funding><para>DOE:DEAC0205CH11231 (Lawrence  Berkeley National Laboratory)</para></funding></project><otherEntity id="ess-dive-bd3673535d0ac85-20220331T150114710741">            <entityName>Brush_csv.zip</entityName>            <entityType>application/zip</entityType>        </otherEntity><otherEntity id="ess-dive-bcb7dbb8c107be2-20220331T150117436851">            <entityName>Brush_Figures.zip</entityName>            <entityType>application/zip</entityType>        </otherEntity>                                            </dataset></eml:eml>