SQL vs Pyspark-1
SQL vs Pyspark-1
DML OPERATIONS
https://www.linkedin.com/in/mrabhijitsahoo/
Concept SQL PySpark
https://www.linkedin.com/in/mrabhijitsahoo/
Concept SQL PySpark
CURDATE,
from pyspark.sql.functions import current_date;
NOW, SELECT CURDATE() FROM table
df.select(current_date())
CURTIME
https://www.linkedin.com/in/mrabhijitsahoo/
Concept SQL PySpark
df.createOrReplaceTempView("cte1");
WITH cte1 AS (SELECT * FROM
df_cte1 = spark.sql("SELECT * FROM
table1),
cte1 WHERE condition");
CTE SELECT * FROM cte1 WHERE
condition df_cte1.show() or
df.filter(condition1).filter(condition2)
https://www.linkedin.com/in/mrabhijitsahoo/
DDL operations
https://www.linkedin.com/in/mrabhijitsahoo/
Concept SQL PySpark
https://www.linkedin.com/in/mrabhijitsahoo/
Concept SQL PySpark
Dropping a
ALTER TABLE table_name
column df = df.drop("column_name")
DROP COLUMN column_name;
https://www.linkedin.com/in/mrabhijitsahoo/
https://www.linkedin.com/in/mrabhijitsahoo
/