Spark SQL#
- Apache Arrow in PySpark
 - Python User-defined Table Functions (UDTFs)
 - Python Data Source API
- Overview
 - Simple Example
 - Creating a Python Data Source
 - Implementing Batch Reader and Writer for Python Data Source
 - Implementing Streaming Reader and Writer for Python Data Source
 - Serialization Requirement
 - Using a Python Data Source
 - Python Data Source Reader with direct Arrow Batch support for improved performance
 
 - Python to Spark Type Conversions