|
PySpark Plaso
Release 2019
A tool for distributed extraction of timestamps from various files using extractors adapted from the Plaso engine to Apache Spark.
|


Public Member Functions | |
| def | __init__ (self, hdfs_base_uri) |
| def | ls (self, hdfs_path="") |
| def | rm (self, hdfs_path="") |
| def | get (self, hdfs_path="") |
| def | get_zip (self, hdfs_path="") |
| def | put (self, request, hdfs_path) |
| def | put_form (self, request, hdfs_path) |
| def | zip_put (self, request, hdfs_path="") |
| def | zip_put_form (self, request, hdfs_path="") |
Public Member Functions inherited from plaso.tarzan.app.controllers.controller.Controller | |
| def | make_hdfs_uri (self, hdfs_path) |
| def | strip_hdfs_uri (self, hdfs_path) |
Public Attributes | |
| hdfs | |
Public Attributes inherited from plaso.tarzan.app.controllers.controller.Controller | |
| hdfs_base_uri | |
Controller for HDFS-related management operations.
| def plaso.tarzan.app.controllers.filemancontroller.FileManController.__init__ | ( | self, | |
| hdfs_base_uri | |||
| ) |
Create a new controller that is able to store and utilize HDFS URI in HDFS queries and operations. :param hdfs_base_uri: the base HDFS URI to store
Reimplemented from plaso.tarzan.app.controllers.controller.Controller.
| def plaso.tarzan.app.controllers.filemancontroller.FileManController.get | ( | self, | |
hdfs_path = "" |
|||
| ) |
Get a Flask Response to download a file in a given HDFS path. :param hdfs_path: the HDFS path of the file to download :return: the Flask Response of the download
| def plaso.tarzan.app.controllers.filemancontroller.FileManController.get_zip | ( | self, | |
hdfs_path = "" |
|||
| ) |
Get a Flask Response to download a ZIP archive of files and directories in a given HDFS path. :param hdfs_path: the HDFS path which include into the ZIP archive :return: the Flask Response to download the ZIP file
| def plaso.tarzan.app.controllers.filemancontroller.FileManController.ls | ( | self, | |
hdfs_path = "" |
|||
| ) |
Get a Flask Response listing all files and directories (the dirs are suffixed by "/") that are in a given HDFS path. :param hdfs_path: the HDFS path to search for the files and directories :return: the Flask Response of the list of files and directories in the path
| def plaso.tarzan.app.controllers.filemancontroller.FileManController.put | ( | self, | |
| request, | |||
| hdfs_path | |||
| ) |
Copy the content uploaded by a PUT request in the REST API to a file in a given HDFS path. :param request: the REST API PUT request :param hdfs_path: the HDFS path where to upload :return: the Flask Response confirming the upload
| def plaso.tarzan.app.controllers.filemancontroller.FileManController.put_form | ( | self, | |
| request, | |||
| hdfs_path | |||
| ) |
Copy the content uploaded by a HTML FORM to a file in a given HDFS path. :param request: the request which contains the FORM upload :param hdfs_path: the HDFS path where to upload :return: the Flask Response confirming the upload
| def plaso.tarzan.app.controllers.filemancontroller.FileManController.rm | ( | self, | |
hdfs_path = "" |
|||
| ) |
Get a Flask Response confirming a recursive removal of files and directories in a given HDFS path. :param hdfs_path: the HDFS path which should be removed including to remove its files and directories :return: the Flask Response of the confirmation of the remove
| def plaso.tarzan.app.controllers.filemancontroller.FileManController.zip_put | ( | self, | |
| request, | |||
hdfs_path = "" |
|||
| ) |
Extract the content of a ZIP archive uploaded by a PUT request in the REST API into a given HDFS path. :param request: the REST API PUT request :param hdfs_path: the HDFS path where to extract the ZIP archive :return: the Flask Response confirming the upload and extraction
| def plaso.tarzan.app.controllers.filemancontroller.FileManController.zip_put_form | ( | self, | |
| request, | |||
hdfs_path = "" |
|||
| ) |
Extract the content of a ZIP archive uploaded by a HTML FORM into a given HDFS path. :param request: the request which contains the FORM upload :param hdfs_path: the HDFS path where to extract the ZIP archive :return: the Flask Response confirming the upload and extraction
| plaso.tarzan.app.controllers.filemancontroller.FileManController.hdfs |
1.8.15