Structural Analysis and Pattern Discovery in URL Datasets


[Up] [Top]

Documentation for package ‘urlexplorer’ version 0.1.0

Help Pages

count_fragments Count fragments in URLs
count_hosts Count different hosts found in URLs
count_param_names Count different parameter names in query strings
count_param_values Count different values for a specified parameter across query strings
count_paths Count different paths found in URLs
count_path_segments Count occurrences of specific path segments at a given index
count_ports Count different port numbers used in URLs
count_queries Count the occurrence of query strings in URLs
count_schemes Count different schemes used in URLs
count_userinfos Count occurrences of userinfo in URLs
extract_file_extension Extract file extension from URLs or paths
extract_fragment Extract the fragment from URL
extract_host Extract the host from URL
extract_param_value Extract the value of a specified parameter from the query string
extract_path Extract the path from URL
extract_path_segment Extract a specific segment from a path
extract_port Extract the port number from URL
extract_query Extract the query from URL
extract_scheme Extract the scheme from URL
extract_userinfo Extract userinfo from URL
split_host Split host into subdomains and domain
split_path Split path into segments
split_query Split query into parameters
split_url Split URL into its constituent parts
websitepages Sample web site URLs