Warning: file_get_contents(/data/phpspider/zhask/data//catemap/8/sorting/2.json): failed to open stream: No such file or directory in /data/phpspider/zhask/libs/function.php on line 167

Warning: Invalid argument supplied for foreach() in /data/phpspider/zhask/libs/tag.function.php on line 1116

Notice: Undefined index: in /data/phpspider/zhask/libs/function.php on line 180

Warning: array_chunk() expects parameter 1 to be array, null given in /data/phpspider/zhask/libs/function.php on line 181
Sorting 在pySpark中,orderby和sort之间有什么区别_Sorting_Sql Order By_Pyspark - Fatal编程技术网

Sorting 在pySpark中,orderby和sort之间有什么区别

Sorting 在pySpark中,orderby和sort之间有什么区别,sorting,sql-order-by,pyspark,Sorting,Sql Order By,Pyspark,这是一个基本问题,但是 两者都用于对数据进行排序,并且都具有相同的功能,如默认情况下的performe asc sort Order by 我认为它们是同义词: 看 def sort(self, *cols, **kwargs): """Returns a new :class:`DataFrame` sorted by the specified column(s). :param cols: list of :class:`Column` or column names t

这是一个基本问题,但是

两者都用于对数据进行排序,并且都具有相同的功能,如默认情况下的performe asc

sort
Order by

我认为它们是同义词:

def sort(self, *cols, **kwargs):
    """Returns a new :class:`DataFrame` sorted by the specified column(s).
    :param cols: list of :class:`Column` or column names to sort by.
    :param ascending: boolean or list of boolean (default True).
        Sort ascending vs. descending. Specify list for multiple sort orders.
        If a list is specified, length of the list must equal length of the `cols`.
    >>> df.sort(df.age.desc()).collect()
    [Row(age=5, name=u'Bob'), Row(age=2, name=u'Alice')]
    >>> df.sort("age", ascending=False).collect()
    [Row(age=5, name=u'Bob'), Row(age=2, name=u'Alice')]
    >>> df.orderBy(df.age.desc()).collect()
    [Row(age=5, name=u'Bob'), Row(age=2, name=u'Alice')]
    >>> from pyspark.sql.functions import *
    >>> df.sort(asc("age")).collect()
    [Row(age=2, name=u'Alice'), Row(age=5, name=u'Bob')]
    >>> df.orderBy(desc("age"), "name").collect()
    [Row(age=5, name=u'Bob'), Row(age=2, name=u'Alice')]
    >>> df.orderBy(["age", "name"], ascending=[0, 1]).collect()
    [Row(age=5, name=u'Bob'), Row(age=2, name=u'Alice')]
    """
    jdf = self._jdf.sort(self._sort_cols(cols, kwargs))
    return DataFrame(jdf, self.sql_ctx)

orderBy = sort