5.2.1 - Spark & PySpark
Last updated Feb 23, 2025
Last updated
Last updated Feb 23, 2025
Last updated
```notebook-python
from pyspark import SparkFiles
file_url = 'https://d37ci6vzurychx.cloudfront.net/trip-data/yellow_tripdata_2024-10.parquet'
spark.sparkContext.addFile(file_url)
# Read into Spark DF
df = spark.read.csv(SparkFiles.get('yellow_tripdata_2024-10.parquet'), header=True)
df.count()
```