Package 'eatocsv'

Title: Download and extract Entity-Attribute metadata into a CSV
Description: Downloads and extracts Entity-Attribute metadata from EML documents stored in a DataONE Member Node
Authors: Bryce Mecum <[email protected]>
Maintainer: Bryce Mecum <[email protected]>
License: Apache License 2.0
Version: 0.1.0
Built: 2025-03-30 05:56:55 UTC
Source: https://github.com/amoeba/eatocsv

Help Index


Download objects from an MN, optionally in parallel

Description

This function uses the future package and therefore allows downloads to happen in sequential order or parallel.

Usage

download_objects(node, pids, path = getwd(), ext = ".xml", overwrite = FALSE)

Arguments

node

(CNode|MNode) The Node to download from

pids

(character) One ore more PIDs to download from the node

path

(character) The destination directory for downloadd Objects

ext

(character) Optional. Specify a custom file extension to add to each downloaded file

overwrite

(boolean) Whether to overwrite existing files (TRUE) or not (FALSE)

Value

(list) A list of file paths for the downloaded files


Extract Entity-Attribute information from a set of EML records

Description

Extract Entity-Attribute information from a set of EML records

Usage

ea_to_csv(paths, datetime = Sys.time())

Arguments

paths

(character) Paths to one or more EML records to read from

datetime

(optional) Optional. Specify a query time to save with the resulting CSV

Value

(data.frame) A data.frame of Entity-Attribute information


Extract Entity-Attribute information from a set of EML records

Description

Extract Entity-Attribute information from a set of EML records

Usage

extract_ea(paths, datetime = Sys.time())

Arguments

paths

(character) Paths to one or more EML records to read from

datetime

(optional) Optional. Specify a query time to save with the resulting CSV

Value

(data.frame) A data.frame of Entity-Attribute information


Extract keywords from EML records

Description

Extract keywords from EML records

Usage

extract_keywords(paths, datetime = Sys.time())

Arguments

paths

(character) Paths to one or more EML records to read from

datetime

(optional) Optional. Specify a query time to save with the resulting CSV

Value

(data.frame) A data.frame of keywords, one row per keyword


Get Entity-Attribute Metadata for one or more Objects

Description

Get Entity-Attribute Metadata for one or more Objects

Usage

get_object_ea(node, identifiers)

Arguments

node

(MNode|CNode) The Node where the Object(s) can be found

identifiers

The Object's identifier (PID)

Value

(data.frame) A table of entity attribute metadata


Santize filenames into valid filesystem paths

Description

This is a custom routine that's probably overly aggressive

Usage

sanitize_filename(filenames)

Arguments

filenames

(character) One or more filenames to sanitized

Value

(character) The sanitized filenames