Pandas groupby.rank(): unexpected behavior with object type column

Code Sampledf = pd.DataFrame({'A':['a','b']})df['B'] = Nonedf.groupby('A').B.rank(method='first') # worksdf.B[0] = 1df.g...

Pandas Bug: SettingWithCopyWarning in some cases cites wrong or no lines when showing where an error occurred.

Changing the topic with some editing:I stumbled over SettingWithCopyWarning outputting wrong lines when referencing wher...

Pandas Unexpected Broadcast error when diving string by float

Code Sample, a copy-pastable example if possibleimport numpy as npimport pandas as pddf = pd.DataFrame(np.random.rand(10...

Pandas Feature request: vectorized sort on string split results

Problem descriptionI'm searching without success a way to sort the results of pd.Series.str.split.I was expecting a str ...

Pandas SUMPRODUCT Excel functionality

Hi,it's not a real issue, though I'm having some hard hours because of it. The simple SUMPRODUCT (=weighted average of c...

Pandas Add a .relabel method; deprecate .rename and .rename_axis for relabeling

This was suggested by @MaximilianR here, but @jorisvandenbossche (rightly) proposed to discuss separately.This is relate...

Pandas PERF: isinstance(x, ABCFoo) vs isinstance(x, Foo)

Using the test cases in tests.dtypes.test_generic, checking isinstance(item, ABCIndex) takes ~2x as long as isinstance(i...

Pandas API: Wrong DateOffset behaviour with DST changes

Given a timezone-aware Timestamp:foo = pd.Timestamp('2016-10-30 00:00:00', tz=pytz.timezone('Europe/Helsinki'))(Please n...

Pandas An option to run once for pandas.DataFrame.apply

In the current implementation apply calls func twice on the first column/row to decide whether it can take a fast or slo...

Pandas to_stata Append to Exisiting file.

with open("C:\Test\Old.dta", "a") as outfile: a = pd.read_stata("C:\Test\New.dta") a.to_stata(outfile,write_index=...

Pandas DOC: SyntaxError in pd.read_msgpack examples

In [319]: for o in pd.read_msgpack('foo.msg',iterator=True):.....: print o.....:File "<ipython-input-319-a0f40395...

Pandas Reindexing raises TypeError on Py2.7

Code Sample, a copy-pastable example if possibleimport pandas as pddf = pd.DataFrame({'x': [1., 5., 3.]}, index=[0, 2, 4...

Pandas perf for read_csv with parse_dates

Caching (memoizing) the date_parser function in read_csv might be an easy perf improvement. Seems it is not cached unles...

Pandas Panel.mean numeric_only=True raises NotImplementedError

p = pd.Panel(data=np.zeros((2, 3, 3)))p.mean(numeric_only=True)<snip></snip>NotImplementedError: Panel.mean ...

Pandas Invalid Type Comparison between numerical column and empty string

Seems like a bug, such a comparison should definitely be allowed. Pandas Version 0.20.2, Ubuntu. In [2]: import pandas a...

Pandas Passing a timezone-aware string to pd.to_datetime creates a timezone-naive Timestamp

Code Sample>>> pd.to_datetime('2017-01-01T15:00:00-05:00')... Timestamp('2017-01-01 20:00:00') # This is a tim...

Pandas Elements of array are converted from object to string type upon writing and reading back csv.

Hi there,I have a pandas array that contains lists of tuples as shown below: APP_chr21 GRI...

Pandas date_range segfaults on None utcoffset

As reported at pendulum, there are cases where creating a date_range fails with a segfault: pendulum issues 131.Code Sam...

Pandas Weird datetime string default parsing on to_datetime

When using default to_datetime, without format, the answers is sometimes a bit weird:>>> import pandas as pd&gt...

Pandas slicing of columns using column index is throwing error(but works for column names)

Code Sample, a copy-pastable example if possible# Your code heredf = pd.read_csv("https://github.com/jennybc/gapmin...
上一页 下一页
.