blob: 337f686c92561efa8591e231f639c1e8c66977d2 [file] [log] [blame]
.. Licensed to the Apache Software Foundation (ASF) under one
or more contributor license agreements. See the NOTICE file
distributed with this work for additional information
regarding copyright ownership. The ASF licenses this file
to you under the Apache License, Version 2.0 (the
"License"); you may not use this file except in compliance
with the License. You may obtain a copy of the License at
.. http://www.apache.org/licenses/LICENSE-2.0
.. Unless required by applicable law or agreed to in writing,
software distributed under the License is distributed on an
"AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
KIND, either express or implied. See the License for the
specific language governing permissions and limitations
under the License.
.. _api.series:
======
Series
======
.. currentmodule:: pyspark.pandas
Constructor
-----------
.. autosummary::
:toctree: api/
Series
Attributes
----------
.. autosummary::
:toctree: api/
Series.index
Series.dtype
Series.dtypes
Series.ndim
Series.name
Series.shape
Series.axes
Series.size
Series.empty
Series.T
Series.hasnans
Series.values
Conversion
----------
.. autosummary::
:toctree: api/
Series.astype
Series.copy
Series.bool
Indexing, iteration
-------------------
.. autosummary::
:toctree: api/
Series.at
Series.iat
Series.loc
Series.iloc
Series.keys
Series.pop
Series.items
Series.iteritems
Series.item
Series.xs
Series.get
Binary operator functions
-------------------------
.. autosummary::
:toctree: api/
Series.add
Series.div
Series.mul
Series.radd
Series.rdiv
Series.rmul
Series.rsub
Series.rtruediv
Series.sub
Series.truediv
Series.pow
Series.rpow
Series.mod
Series.rmod
Series.floordiv
Series.rfloordiv
Series.divmod
Series.rdivmod
Series.combine_first
Series.lt
Series.gt
Series.le
Series.ge
Series.ne
Series.eq
Series.product
Series.dot
Function application, GroupBy & Window
--------------------------------------
.. autosummary::
:toctree: api/
Series.apply
Series.agg
Series.aggregate
Series.transform
Series.map
Series.groupby
Series.rolling
Series.expanding
Series.pipe
.. _api.series.stats:
Computations / Descriptive Stats
--------------------------------
.. autosummary::
:toctree: api/
Series.abs
Series.all
Series.any
Series.between
Series.clip
Series.corr
Series.count
Series.cov
Series.cummax
Series.cummin
Series.cumsum
Series.cumprod
Series.describe
Series.filter
Series.kurt
Series.mad
Series.max
Series.mean
Series.min
Series.mode
Series.nlargest
Series.nsmallest
Series.pct_change
Series.prod
Series.nunique
Series.is_unique
Series.quantile
Series.rank
Series.sem
Series.skew
Series.std
Series.sum
Series.median
Series.var
Series.kurtosis
Series.unique
Series.value_counts
Series.round
Series.diff
Series.is_monotonic
Series.is_monotonic_increasing
Series.is_monotonic_decreasing
Reindexing / Selection / Label manipulation
-------------------------------------------
.. autosummary::
:toctree: api/
Series.align
Series.drop
Series.droplevel
Series.drop_duplicates
Series.equals
Series.add_prefix
Series.add_suffix
Series.first
Series.head
Series.idxmax
Series.idxmin
Series.isin
Series.last
Series.rename
Series.rename_axis
Series.reindex
Series.reindex_like
Series.reset_index
Series.sample
Series.swaplevel
Series.swapaxes
Series.take
Series.tail
Series.where
Series.mask
Series.truncate
Missing data handling
---------------------
.. autosummary::
:toctree: api/
Series.backfill
Series.bfill
Series.isna
Series.isnull
Series.notna
Series.notnull
Series.pad
Series.dropna
Series.fillna
Reshaping, sorting, transposing
-------------------------------
.. autosummary::
:toctree: api/
Series.argsort
Series.argmin
Series.argmax
Series.sort_index
Series.sort_values
Series.unstack
Series.explode
Series.repeat
Series.squeeze
Series.factorize
Combining / joining / merging
-----------------------------
.. autosummary::
:toctree: api/
Series.append
Series.compare
Series.replace
Series.update
Time series-related
-------------------
.. autosummary::
:toctree: api/
Series.asof
Series.shift
Series.first_valid_index
Series.last_valid_index
Series.at_time
Series.between_time
Spark-related
-------------
``Series.spark`` provides features that does not exist in pandas but
in Spark. These can be accessed by ``Series.spark.<function/property>``.
.. autosummary::
:toctree: api/
Series.spark.column
Series.spark.transform
Series.spark.apply
Accessors
---------
Pandas API on Spark provides dtype-specific methods under various accessors.
These are separate namespaces within :class:`Series` that only apply
to specific data types.
=========== ===========================
Data Type Accessor
=========== ===========================
Datetime :ref:`dt <api.series.dt>`
String :ref:`str <api.series.str>`
Categorical :ref:`cat <api.series.cat>`
=========== ===========================
.. _api.series.dt:
Date Time Handling
------------------
``Series.dt`` can be used to access the values of the series as
datetimelike and return several properties.
These can be accessed like ``Series.dt.<property>``.
Datetime Properties
~~~~~~~~~~~~~~~~~~~
.. autosummary::
:toctree: api/
Series.dt.date
Series.dt.year
Series.dt.month
Series.dt.day
Series.dt.hour
Series.dt.minute
Series.dt.second
Series.dt.microsecond
Series.dt.week
Series.dt.weekofyear
Series.dt.dayofweek
Series.dt.weekday
Series.dt.dayofyear
Series.dt.quarter
Series.dt.is_month_start
Series.dt.is_month_end
Series.dt.is_quarter_start
Series.dt.is_quarter_end
Series.dt.is_year_start
Series.dt.is_year_end
Series.dt.is_leap_year
Series.dt.daysinmonth
Series.dt.days_in_month
Datetime Methods
~~~~~~~~~~~~~~~~
.. autosummary::
:toctree: api/
Series.dt.normalize
Series.dt.strftime
Series.dt.round
Series.dt.floor
Series.dt.ceil
Series.dt.month_name
Series.dt.day_name
.. _api.series.str:
String Handling
---------------
``Series.str`` can be used to access the values of the series as
strings and apply several methods to it. These can be accessed
like ``Series.str.<function/property>``.
.. autosummary::
:toctree: api/
Series.str.capitalize
Series.str.cat
Series.str.center
Series.str.contains
Series.str.count
Series.str.decode
Series.str.encode
Series.str.endswith
Series.str.extract
Series.str.extractall
Series.str.find
Series.str.findall
Series.str.get
Series.str.get_dummies
Series.str.index
Series.str.isalnum
Series.str.isalpha
Series.str.isdigit
Series.str.isspace
Series.str.islower
Series.str.isupper
Series.str.istitle
Series.str.isnumeric
Series.str.isdecimal
Series.str.join
Series.str.len
Series.str.ljust
Series.str.lower
Series.str.lstrip
Series.str.match
Series.str.normalize
Series.str.pad
Series.str.partition
Series.str.repeat
Series.str.replace
Series.str.rfind
Series.str.rindex
Series.str.rjust
Series.str.rpartition
Series.str.rsplit
Series.str.rstrip
Series.str.slice
Series.str.slice_replace
Series.str.split
Series.str.startswith
Series.str.strip
Series.str.swapcase
Series.str.title
Series.str.translate
Series.str.upper
Series.str.wrap
Series.str.zfill
.. _api.series.cat:
Categorical accessor
--------------------
Categorical-dtype specific methods and attributes are available under
the ``Series.cat`` accessor.
.. autosummary::
:toctree: api/
Series.cat.categories
Series.cat.ordered
Series.cat.codes
Series.cat.rename_categories
Series.cat.reorder_categories
Series.cat.add_categories
Series.cat.remove_categories
Series.cat.remove_unused_categories
Series.cat.set_categories
Series.cat.as_ordered
Series.cat.as_unordered
.. _api.series.plot:
Plotting
-------------------------------
``Series.plot`` is both a callable method and a namespace attribute for
specific plotting methods of the form ``Series.plot.<kind>``.
.. autosummary::
:toctree: api/
Series.plot
Series.plot.area
Series.plot.bar
Series.plot.barh
Series.plot.box
Series.plot.density
Series.plot.hist
Series.plot.line
Series.plot.pie
Series.plot.kde
Series.hist
Serialization / IO / Conversion
-------------------------------
.. autosummary::
:toctree: api/
Series.to_pandas
Series.to_numpy
Series.to_list
Series.to_string
Series.to_dict
Series.to_clipboard
Series.to_latex
Series.to_markdown
Series.to_json
Series.to_csv
Series.to_excel
Series.to_frame
Pandas-on-Spark specific
------------------------
``Series.pandas_on_spark`` provides pandas-on-Spark specific features that exists only in pandas API on Spark.
These can be accessed by ``Series.pandas_on_spark.<function/property>``.
.. autosummary::
:toctree: api/
Series.pandas_on_spark.transform_batch