Skip to content

Speed up SAS7BDAT parser #7

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Draft
wants to merge 252 commits into
base: main
Choose a base branch
from
Draft
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
252 commits
Select commit Hold shift + click to select a range
25749d2
ENH: DTA.to_period support non-nano (#47324)
jbrockmendel Jun 13, 2022
f042800
Add test for multi-column dtype assignment (#47323)
jackgoldsmith4 Jun 13, 2022
28d5b01
ENH: Timestamp.normalize support non-nano (#47316)
jbrockmendel Jun 13, 2022
8de88ff
BUG: DateOffset addition preserve non-nano (#47334)
jbrockmendel Jun 13, 2022
b74dc5c
ENH: Timestamp +- timedeltalike scalar support non-nano (#47313)
jbrockmendel Jun 13, 2022
87da500
ENH: Timestamp.replace support non-nano (#47312)
jbrockmendel Jun 13, 2022
f40203c
ENH: Move UndefinedVariableError to error/__init__.py per GH27656 (#4…
dataxerik Jun 13, 2022
a8d8ae7
ENH: get_resolution support non-nano (#47322)
jbrockmendel Jun 13, 2022
38a7d29
ENH: Timestamp.tz_convert support non-nano (#47320)
jbrockmendel Jun 13, 2022
032c590
DOC: added example of valid input dict in dfgroupby.aggregate (#47317)
jasonjiajs Jun 13, 2022
830130a
TYP: Series.quantile (#47304)
twoertwein Jun 13, 2022
15902bd
[EHN] pandas.DataFrame.to_orc (#44554)
chloeandmargaret Jun 14, 2022
696e9bd
TYP: plotting._matplotlib (#47311)
twoertwein Jun 14, 2022
ad84367
BUG: concat not sorting mixed column names when None is included (#47…
phofl Jun 14, 2022
d0e82ed
Add run-tests action (#47292)
jonashaag Jun 14, 2022
007bf4a
ENH: Timestamp pickle support non-nano tzaware (#47340)
jbrockmendel Jun 14, 2022
f600fd4
ENH: preserve non-nano DTA/TDA in Index/Series/DataFrame (#47230)
jbrockmendel Jun 15, 2022
f7be58a
ENH: Timestamp.__sub__(datetimelike) support non-nano (#47346)
jbrockmendel Jun 15, 2022
7310d90
REGR: Fix nan comparison for same Index object (#47326)
phofl Jun 15, 2022
7c6a76a
REGR: Avoid regression warning with ea dtype and assert_index_equal o…
phofl Jun 15, 2022
6f0be79
REGR: MultiIndex.dtypes has regular Index instead of MultiIndex index…
phofl Jun 15, 2022
5f62767
ENH: DTA to_pydatetime, time, timetz, date, iter support non-nano (#4…
jbrockmendel Jun 15, 2022
4dfe48f
ENH: Move IndexingError to error/__init__.py per GH27656 (#47357)
dataxerik Jun 15, 2022
ff36ff4
improve period constructor docs (#47321)
jackgoldsmith4 Jun 15, 2022
a8153a8
TYP: base_parser and readers (#47359)
twoertwein Jun 15, 2022
7397adc
Allow reading SAS files from archives (#47154)
jonashaag Jun 15, 2022
9c241fe
BUG Avoid IndexError on writing empty (row) MI df to excel (GH19543) …
wf-r Jun 15, 2022
a803b7c
TST: adding a test for bar plot with intervalrange xaxis (#47344)
noatamir Jun 15, 2022
4489725
CI: Temporarily build pandas with N_JOBS=1 to avoid flakiness (#47341)
mroeschke Jun 15, 2022
92d2233
ENH: Timedelta division support non-nano (#47373)
jbrockmendel Jun 15, 2022
18124f9
ENH: Timestamp.tz_localize support non-nano (#47355)
jbrockmendel Jun 15, 2022
c6e7d8d
ENH: Timedelta/Timestamp round support non-nano (#47356)
jbrockmendel Jun 16, 2022
38b0a40
ENH: DTA/DTI __repr__ support non-nano (#47374)
jbrockmendel Jun 16, 2022
111bcbb
Do not install C sources in binary distributions (#46739)
musicinmybrain Jun 16, 2022
89578fe
REGR: Regression in to_csv for ea dtype categorical (#47347)
phofl Jun 16, 2022
2517b92
ENH/TST: Add TestBaseDtype tests for ArrowExtensionArray (#47358)
mroeschke Jun 17, 2022
22cb379
REF: Avoid ravel in DTA._format_native_types (#47411)
jbrockmendel Jun 18, 2022
47494a4
CLN: Remove .github markdown files (#47412)
mroeschke Jun 18, 2022
3f72d8c
Revert "REF: remove JoinUnit.shape (#43651)" (#47406)
jorisvandenbossche Jun 21, 2022
01b8cd2
CI: Pin PYTEST_WORKERS=1 for Windows builds due to memory errors (#47…
mroeschke Jun 21, 2022
f7c8fc8
TST: GH26650, added new test to validate numpy matmul function with d…
carbonleakage Jun 21, 2022
f6eacb4
CI: remove comment-bot (#47416)
MarcoGorelli Jun 21, 2022
3364f9a
REGR: Fix fillna making a copy when dict was given as fill value and …
phofl Jun 21, 2022
fd9b2a4
BUG: read_csv may interpret second row as index names even if index_c…
phofl Jun 21, 2022
afaf268
Consistent handling of 0-dim in Timedelta arithmetic methods (#47390)
jbrockmendel Jun 21, 2022
820a065
implement abbrev_to_npy_unit (#47395)
jbrockmendel Jun 21, 2022
37d4c31
REF: do masking in checked_add_with_arr (#47396)
jbrockmendel Jun 21, 2022
8dc832f
annotation, missing test case, perf DTA.mode (#47418)
jbrockmendel Jun 21, 2022
9821a2e
REF: simplify Timestamp.replace (#47435)
jbrockmendel Jun 21, 2022
c5a640d
Add tests (#47420)
jackgoldsmith4 Jun 21, 2022
b022a3b
BUG: read_excel raising uncontrolled IndexError when header reference…
phofl Jun 21, 2022
e63885f
DOC: clarify to_csv float format docstring (#47436)
phofl Jun 21, 2022
c7c2bcf
DOC: Remove unused/flaky statsmodels intersphinx_mapping (#47452)
mroeschke Jun 21, 2022
10967ce
BUG: iterparse of read_xml not parsing duplicate element and attribut…
ParfaitG Jun 21, 2022
dcf041b
BUG: df.getitem returning copy instead of view for unique column in d…
phofl Jun 22, 2022
fa5a604
DEPS: Sync environment.yml with CI dep files (#47287)
mroeschke Jun 22, 2022
d40c371
ENH/TST: Add BaseInterfaceTests tests for ArrowExtensionArray (#47377)
mroeschke Jun 22, 2022
faacb72
Fix segmentation fault when JSON serializing a PeriodIndex (#47431)
roberthdevries Jun 22, 2022
2f3ac16
CI/TST: Don't require length for construct_1d_arraylike_from_scalar c…
mroeschke Jun 22, 2022
fc11311
Revert "ENH/TST: Add BaseInterfaceTests tests for ArrowExtensionArray…
simonjayhawkins Jun 22, 2022
d43d6e2
REGR: revert behaviour change for concat with empty/all-NaN data (#47…
jorisvandenbossche Jun 22, 2022
cc41458
DOC: v1.4.3 release date (#47476)
simonjayhawkins Jun 23, 2022
8c9b84d
TYP: Fix typing errors on main (#47469)
mroeschke Jun 23, 2022
45f094d
DOC: Start v1.4.4 release notes (#47473)
simonjayhawkins Jun 23, 2022
878504d
CI: Run all code checks even if one fails (#47482)
mroeschke Jun 23, 2022
d3bb89d
Fix signature of initObjToJSON (#47466)
hoodmane Jun 23, 2022
771f2b1
Deprecate non-keyword arguments for rsplit (#47446)
MrShevan Jun 23, 2022
2c947e0
DOC: Add an interactive shell powered by JupyterLite to the website (…
jtpio Jun 23, 2022
1b79e2d
ENH: consistent add/sub behavior for mixed resolutions (#47394)
jbrockmendel Jun 23, 2022
8c0dbaa
DOC: move enhancements in 1.5 release notes (#47375)
simonjayhawkins Jun 23, 2022
bc40a4a
TST: GH 27185 Test to check df with timedelta & Int NA sums correctly…
aamnv Jun 23, 2022
3c100a5
BUG: assert_index_equal ignoring names when check_order is false (#47…
phofl Jun 23, 2022
76c242d
TYP: NumericDtype._standardize_dtype (#47298)
twoertwein Jun 23, 2022
5d880ea
BUG: Fix issues with numeric_only deprecation (#47481)
rhshadrach Jun 23, 2022
70adab9
PERF improve performance of is_lexsorted (#47459)
MarcoGorelli Jun 24, 2022
27135a5
Improve error message for DataFrame.from_dict when wrong orient is pr…
KianShah Jun 24, 2022
e5c7543
BUG: isin casting to float64 for unsigned int and list (#46693)
phofl Jun 24, 2022
dc36ce1
ENH/TST: Add BaseInterfaceTests tests for ArrowExtensionArray PT2 (#4…
mroeschke Jun 24, 2022
b345d5f
DOC: Minor fixes in whatsnew 1.5.0 (#47498)
rhshadrach Jun 24, 2022
0ae6426
DOC: Resolve links in whatsnew (#47499)
rhshadrach Jun 24, 2022
c3b4af9
BUG: read_csv ignoring non existing header rows for python engine (#4…
phofl Jun 24, 2022
963111d
REF: remove dtstruct_to_dt64 (#47492)
jbrockmendel Jun 24, 2022
7683563
BUG: DataFrame.setitem raising when rhs is ea dtype Series (#47425)
phofl Jun 24, 2022
6786ab2
TYP: read_sas (#47410)
twoertwein Jun 24, 2022
d580826
DEPR: na_sentinel in factorize (#47157)
rhshadrach Jun 24, 2022
5e3d0ed
Fix SAS 0x40 control byte handling and column name parsing (#47113)
jonashaag Jun 24, 2022
64ad019
TYP: ndim is consistently a property (#47378)
twoertwein Jun 25, 2022
cf05e2f
ENH: Move PyperclipException and PyperclipWindowsException to error/_…
dataxerik Jun 25, 2022
109be48
TST,WARN: read_csv raises warning at wrong stacklevel, but _assert_ra…
MarcoGorelli Jun 26, 2022
7d2f9b8
TYP: some return annotations in pytables.py (#47512)
twoertwein Jun 26, 2022
6e0f8b8
Fix `vertical-align: middle;` for `Styler.to_excel` with xlsxwriter e…
tehunter Jun 27, 2022
428540b
DOC: Fix a typo in documentation for styler (#47517)
saiwing-yeung Jun 27, 2022
cb67837
TYP: _item_cache and _ixs (#47506)
MarcoGorelli Jun 27, 2022
4bb1fd5
TYP: Missing return annotations in util/tseries/plotting (#47510)
twoertwein Jun 27, 2022
f81ac72
Cache Conda env (#47454)
jonashaag Jun 27, 2022
90c7817
ENH: TDA.total_seconds support non-nano (#47421)
jbrockmendel Jun 27, 2022
734db4f
TYP: Return annotations for io/{formats,json} (#47516)
twoertwein Jun 27, 2022
1bc9197
BUG: read_xml iterparse doesn't handle multiple toplevel elements wi…
ParfaitG Jun 27, 2022
2bcbd25
Bug: GroupBy raising error with None in first level of MultiIndex (#4…
phofl Jun 27, 2022
2fc7fab
added link to advanced indexing guide in .loc docstring (#47520)
INDIG0N Jun 28, 2022
e48c9c3
TYP: more return annotations for io/* (#47524)
twoertwein Jun 28, 2022
7dad4e7
TYP: misc (#47509)
twoertwein Jun 28, 2022
612f566
ENH: TimedeltaArray add/sub with NaT preserve reso (#47522)
jbrockmendel Jun 28, 2022
cfc7234
WEB: Add Telegram to follow us and footer (#47531)
datapythonista Jun 28, 2022
fd1f643
WARN: Clarify datetime warning when can't parse in accordance with da…
MarcoGorelli Jun 28, 2022
80c005e
ENH: DatetimeIndex.indexer_between_time support non-nano (#47535)
jbrockmendel Jun 28, 2022
2b1184d
BUG: PeriodArray.__sub__(Period) overflows (#47538)
jbrockmendel Jun 28, 2022
2288135
ENH/TST: Add BaseParsinngTests tests for ArrowExtensionArray (#47536)
mroeschke Jun 29, 2022
ad842d3
PERF: Improve Styler `to_excel` Performance (#47371)
tehunter Jun 29, 2022
f4ca4d3
ENH: DatetimeIndex.snap support non-nano (#47537)
jbrockmendel Jun 29, 2022
417d1c9
ASV: Non-unique DataFrame index (#47551)
kevinanker Jun 30, 2022
32cb5aa
Stacklevel argument updated #46687 (#47035)
Anupam-USP Jun 30, 2022
4f566c8
DOC: update tutorials.rst to include website for searchable Pandas re…
TraverseTowner Jun 30, 2022
ed55bdf
Initial draft: from_dummies (#41902)
pckSF Jun 30, 2022
cd2b819
BUG: DataFrame.loc not aligning dict when setting to a column (#47361)
phofl Jun 30, 2022
9a3231f
DOC clarify inplace operation section in 1.5 whats_new (#47433)
lesteve Jun 30, 2022
6fd3376
REGR: concat materializing index even if sort is not necessary (#47508)
phofl Jun 30, 2022
8ce4f29
DOC: Convert URL example to static code block (#47540)
mroeschke Jun 30, 2022
231b9fa
REF: share some constructor code (#47555)
jbrockmendel Jul 1, 2022
ebc96ae
REF: Reduce duplicative methods between XML parser classes (#47553)
ParfaitG Jul 1, 2022
f6ba50a
TYP: fix type of level in DataFrame.groupby (#47560)
twoertwein Jul 1, 2022
3e9aaf4
[DOC] improving the ExcelWriter docstring (#47568)
noatamir Jul 1, 2022
9f38929
Use license_files instead of license_file (#47572)
phofl Jul 1, 2022
6792f5c
REF: simplify DTA.__init__ (#47574)
jbrockmendel Jul 1, 2022
5266143
CLN: Move monotonic check to safe_sort_index (#47573)
phofl Jul 1, 2022
55d9dcf
PERF: Some faster date/time string formatting (#46759)
smarie Jul 1, 2022
bd9a6f0
BUG: Series.setitem losing precision when enlarging (#47342)
phofl Jul 1, 2022
f8b3585
CLN: concat union_indexes now respecting sort keyword (#47588)
phofl Jul 3, 2022
4fddc41
BUG: DataFrame.loc not aligning rhs df for single block case (#47581)
phofl Jul 3, 2022
d05bdeb
Adding testcase for quartile performed on a timestamp column group by…
dannyi96 Jul 3, 2022
a7c8513
TYP: fix a few errors found by pandas-stub (#47562)
twoertwein Jul 3, 2022
f538568
TYP: misc return type annotations (#47558)
twoertwein Jul 3, 2022
b81f431
ENH/TST: Add BaseGroupbyTests tests for ArrowExtensionArray (#47515)
mroeschke Jul 3, 2022
1ac1391
BUG: concat losing columns dtypes for join=outer (#47586)
phofl Jul 3, 2022
1924be3
PERF: use c-division in nancorr (#47518)
MarcoGorelli Jul 3, 2022
2856607
BUG: to_sql with method=callable not returning int raising TypeError …
mroeschke Jul 3, 2022
4fe2f31
BUG: read_excel not forward filling index columns when no names are g…
phofl Jul 3, 2022
37d0194
DEPR: numeic_only in Series and SeriesGroupBy consistency (#47561)
rhshadrach Jul 5, 2022
5fb276a
TST: added test for to_json when called on numbers exceeding the int6…
johnmantios Jul 5, 2022
fa76d81
BUG Fix: pd.apply returns a dataframe when empty dataframe instead of…
Jul 5, 2022
319964a
TST: added test for handling repeated keys when using Series.loc with…
johnmantios Jul 5, 2022
e52f8b7
ENH: Move CSSWarning to error/__init__.py per GH27656 (#47591)
dataxerik Jul 5, 2022
8e8f627
Fix SAS7BDAT run-length encoding formula (#47099)
jonashaag Jul 5, 2022
b6d5e97
REF: share DTA/TDA.__init__ (#47583)
jbrockmendel Jul 5, 2022
4d17588
CI: flake8-pyi (#47595)
twoertwein Jul 5, 2022
700ef33
ENH/TST: Add BaseMethodsTests tests for ArrowExtensionArray (#47552)
mroeschke Jul 5, 2022
67e8c4c
ENH: DTI/DTA.astype support non-nano (#47579)
jbrockmendel Jul 5, 2022
37e6239
TST: Test aggregate with list values #25581 (#47559)
srotondo Jul 5, 2022
d9dd128
Revert inclusive default change of IntervalDtype (#47367)
phofl Jul 6, 2022
7a29d4a
DOC: fix shared compression_options and decompression_options (#47609)
alastair Jul 6, 2022
a9a496c
⬆️ UPGRADE: Autoupdate pre-commit config (#47564)
github-actions[bot] Jul 6, 2022
9612375
TYP: return values in core/*.py (#47587)
twoertwein Jul 6, 2022
9a8b193
DOC: Fixed pandas capitalization (#47617)
Marvjowa Jul 6, 2022
b4ec020
TST: Added test for strides in column major Dataframes stored in HDFS…
johnmantios Jul 7, 2022
efebf78
BUG: Return Float64 for read_parquet(use_nullable_dtypes=True) (#47619)
mroeschke Jul 7, 2022
028d0d6
WARN: Don't show FutureWarning when enlarging df with iloc (#47621)
phofl Jul 7, 2022
f654176
TYP: more return annotations in core/ (#47618)
twoertwein Jul 7, 2022
2141c1c
CLN: Add tets and fix docstring for concat with sort=True and outer j…
phofl Jul 7, 2022
d52f2ce
Test for nested series equality #22400 (#47627)
srotondo Jul 7, 2022
fce0953
ENH/TST: Add TestBaseReshaping tests for ArrowExtensionArray (#47601)
mroeschke Jul 8, 2022
3adfcff
BUG: using read_xml with iterparse and names will ignore duplicate va…
ParfaitG Jul 8, 2022
1ad2821
DOC: avoid overriding Python built-in functions (#47631)
wany-oh Jul 8, 2022
16e9b1e
TST: adding test for multiindex nunique raising not implemnted error …
noatamir Jul 8, 2022
f6658ef
DEPR: Remove deprecation from private class IntervalTree (#47637)
phofl Jul 8, 2022
64ae0fe
POC/ENH: Timedelta min/max/resolution support non-nano (#47641)
jbrockmendel Jul 8, 2022
cef9aad
Fixed mypy errors in frequencies.py (#47644)
srotondo Jul 8, 2022
bd4ff39
DEPR: Deprecate set_closed and add set_incluive (#47636)
phofl Jul 8, 2022
aa5b866
"DOC #45443 edited the documentation of where/mask functions" (#47626)
ahmedibrhm Jul 8, 2022
03b0b2c
CI: Bump Mac/Windows build timeout to 120 minutes (#47523)
mroeschke Jul 8, 2022
2367feb
BUG: boolean indexer with NA raising when reindex is necessary (#47623)
phofl Jul 8, 2022
8e6ca28
BUG: Fixed `tm.set_locale` context manager, it could error and leak w…
smarie Jul 8, 2022
e915b0a
ENH: RangeIndex._shallow_copy can return RangeIndex (#47557)
mroeschke Jul 8, 2022
5506476
TYP: Improve typing interval inclusive (#47646)
phofl Jul 9, 2022
f74a186
BUG: Add `storage_option` parameter to to_excel method in Styler (#46…
eshirvana Jul 9, 2022
a4fec22
TYP: enable reportUnusedImport (#46937)
twoertwein Jul 10, 2022
9b63034
BUG: Series.str.zfill() behaves differently than str.zfill() from sta…
xr-chen Jul 10, 2022
102b3ca
CLN: Rename private variables to inclusive (#47655)
phofl Jul 10, 2022
01d0874
TYP: fix some of the __hash__ methods (#47654)
twoertwein Jul 10, 2022
262a716
WEB: Update NumFOCUS committee members (#47650)
datapythonista Jul 10, 2022
03af0ac
REGR: maybe_convert_objects ignoring uints (#47475)
rhshadrach Jul 10, 2022
56dc719
SAS7BDAT parser: Faster string parsing (#47404)
jonashaag Jul 10, 2022
f7f3e6e
DOC: Fixed CoC broken link (#47661)
RadoslawLemiec Jul 10, 2022
63caef9
Enh move pytable errors and warnings (#47662)
dataxerik Jul 10, 2022
5ec6139
TYP: make _engine_type consistently a property (#47664)
twoertwein Jul 11, 2022
ae1d859
WEB: Add governance page to the website (#47660)
datapythonista Jul 11, 2022
25d1c17
ci: add GitHub token permissions for workflows (#47652)
varunsh-coder Jul 11, 2022
6867bab
Code review from #46759 : moved strftime benchmark file outside of ts…
smarie Jul 11, 2022
b03389e
BUG: df.groupby().resample()[[cols]] without key columns raise KeyErr…
GYHHAHA Jul 11, 2022
669f21f
ENH: TDA.__mul__ support non-nano (#47668)
jbrockmendel Jul 11, 2022
2eca7e1
ENH: TDA+datetime_scalar support non-nano (#47675)
jbrockmendel Jul 12, 2022
f23e441
WEB: Updating links of the governance page (#47679)
datapythonista Jul 12, 2022
de40565
TYP: make na_value consistently a property (#47676)
twoertwein Jul 12, 2022
de5c85b
CLN: non-nano follow-ups (#47682)
jbrockmendel Jul 12, 2022
74f4e81
DOC: Clarify that FrozenList is hashable (#47684)
pyrito Jul 12, 2022
c711be0
TST: avoid sort when concat int-index Dataframes with sort=False (#47…
GYHHAHA Jul 12, 2022
950408e
ENH: dt64/td64 comparison support non-nano (#47691)
jbrockmendel Jul 13, 2022
13e859f
CI: Fix npdev build post Cython annotation change (#47670)
mroeschke Jul 13, 2022
5690e8f
ENH: Move database error to error/__init__.py per GH27656 (#47674)
dataxerik Jul 14, 2022
a7b8c1d
TST: add test for groupby with dropna=False on multi-index (#47717)
GYHHAHA Jul 14, 2022
0a8b45f
BUG: df.fillna ignores axis when df is single block (#47714)
GYHHAHA Jul 14, 2022
ff74bb6
DEPR: deprecate unused errors in NDFrame.where/mask (#47728)
twoertwein Jul 15, 2022
87e9c4a
BUG: Make xticks from _quarterly_finder() line up better (#47602)
elidourado Jul 15, 2022
030f870
BUG: groupby.corrwith fails with axis=1 and other=df (#47731)
GYHHAHA Jul 15, 2022
53af38e
GH: Add CITATION.cff (#47710)
mroeschke Jul 16, 2022
1f20e93
GH: Convert feature request template to GH form (#47696)
mroeschke Jul 16, 2022
4f54bf6
DOC: Centeralized testing guidance for contributions (#47692)
mroeschke Jul 16, 2022
0b8d8bb
ENH/TST: Add TestBaseArithmeticOps tests for ArrowExtensionArray #47…
mroeschke Jul 16, 2022
cb5a924
ENH: add result_names argument to DataFrame.compare #44354 (#47643)
ahmedibrhm Jul 16, 2022
e3698dd
TST: Test for the Enum triggering TypeError (#22551 issue) (#47715)
Shadimrad Jul 16, 2022
ad7dcef
BUG: numeric_only with axis=1 in DataFrame.corrwith and DataFrameGrou…
rhshadrach Jul 16, 2022
0a6be38
BUG: Correct numeric_only default for resample var and std (#47749)
rhshadrach Jul 16, 2022
5eca390
DOC: fix typos in "See also" documentation section (#47712)
partev Jul 16, 2022
fc68a9a
BUG: Fix pc.power_checked min version (#47752)
mroeschke Jul 17, 2022
6251e0b
TYP: reflect ensure_* function removals (#47758)
hauntsaninja Jul 17, 2022
bdd9314
TYP: Update timestamps.pyi (#47756)
hauntsaninja Jul 17, 2022
08d38a3
add ignore for new mypy error 'type-var' (#47766)
anilbey Jul 17, 2022
56bfb41
TST: misplaced string array test (#47765)
jorisvandenbossche Jul 18, 2022
833ef31
Specify that both ``by`` and ``level`` should not be specified in ``g…
GivyBoy Jul 18, 2022
2d6e0b2
Cln tests interval wrt inclusive (#47775)
phofl Jul 18, 2022
2ff1d0a
REF: de-duplicate get_conversion_factor (#47770)
jbrockmendel Jul 18, 2022
e9655f5
STYLE add future annotations where possible (#47769)
MarcoGorelli Jul 18, 2022
0a26cdd
TYP: Appender also works with properties (#47768)
twoertwein Jul 18, 2022
b731518
PERF: operations with zoneinfo tzinfos (#47767)
jbrockmendel Jul 18, 2022
87930ef
TYP: a few mismatches found by stubtest (#47764)
twoertwein Jul 18, 2022
f9346a6
BUG: Set y-axis label, limits and ticks for a secondary y-axis (#4775…
StefRe Jul 18, 2022
089f7f8
TYP: def validate_* (#47750)
twoertwein Jul 18, 2022
128832f
TST: add test for last() on dataframe grouped by on boolean column (#…
kapiliyer Jul 18, 2022
250d971
DOC: update min package versions in install.rst to align with v.1.5.0…
JMBurley Jul 18, 2022
cf4758f
opt out of bottleneck for nanmean (#47716)
sebasv Jul 18, 2022
efd15b7
TYP: freq and na_value (#47729)
twoertwein Jul 18, 2022
1b1dd36
BUG: fix regression in Series[string] setitem setting a scalar with a…
jorisvandenbossche Jul 18, 2022
a7c5773
FIX: PeriodIndex json roundtrip (#47747)
jmg-duarte Jul 18, 2022
3a39d25
TST: Add additional test for future warning when call Series.str.cat(…
xr-chen Jul 19, 2022
9bf1e96
BUG: wide_to_long fails when stubname misses and i contains string ty…
GYHHAHA Jul 19, 2022
9f5c8b9
BUG: PeriodIndex + TimedeltaArray-with-NaT (#47783)
jbrockmendel Jul 20, 2022
f7e0e68
DOC: Fix versionadded for callable in on_bad_lines (#47792)
phofl Jul 20, 2022
187636f
WEB: Update to `jupyterlite==0.1.0b10` (#47532)
jtpio Jul 21, 2022
caf261f
REF: re-use convert_reso (#47807)
jbrockmendel Jul 21, 2022
060ce49
TYP: Column.null_count is a Python int (#47804)
mroeschke Jul 21, 2022
0ce2921
Update join docs for other param (#46850)
multimeric Jul 21, 2022
bd31d64
BUG: Series map ignoring na_action for dict or series mapper (#47585)
phofl Jul 21, 2022
96b036c
ENH: Timestamp.min/max/resolution support non-nano (#47720)
jbrockmendel Jul 21, 2022
433dcd5
ENH/TST: Add BaseUnaryOpsTests tests for ArrowExtensionArray (#47711)
mroeschke Jul 21, 2022
8f04a8e
ENH/TST: Add isin, _hasna for ArrowExtensionArray (#47805)
mroeschke Jul 22, 2022
bedd8f0
ENH/TST: Add Reduction tests for ArrowExtensionArray (#47730)
mroeschke Jul 22, 2022
3d94f7a
BUG: fix SparseArray.unique IndexError and _first_fill_value_loc algo…
GYHHAHA Jul 22, 2022
d8bb752
DOC: Updating some capitalization in doc/source/user_guide #32550 (#4…
SomtochiUmeh Jul 22, 2022
8c7b0b2
TST/CI: xfail test_round_sanity for 32 bit (#47803)
mroeschke Jul 22, 2022
a62897a
WEB: Update sponsors in website (#47678)
datapythonista Jul 24, 2022
7cd9b82
Speed up SAS7BDAT parser
jonashaag May 23, 2022
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
The table of contents is too big for display.
Diff view
Diff view
  •  
  •  
  •  
62 changes: 0 additions & 62 deletions .github/CODE_OF_CONDUCT.md

This file was deleted.

3 changes: 0 additions & 3 deletions .github/CONTRIBUTING.md

This file was deleted.

3 changes: 0 additions & 3 deletions .github/FUNDING.yml

This file was deleted.

33 changes: 0 additions & 33 deletions .github/ISSUE_TEMPLATE/feature_request.md

This file was deleted.

72 changes: 72 additions & 0 deletions .github/ISSUE_TEMPLATE/feature_request.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,72 @@
name: Feature Request
description: Suggest an idea for pandas
title: "ENH: "
labels: [Enhancement, Needs Triage]

body:
- type: checkboxes
id: checks
attributes:
label: Feature Type
description: Please check what type of feature request you would like to propose.
options:
- label: >
Adding new functionality to pandas
- label: >
Changing existing functionality in pandas
- label: >
Removing existing functionality in pandas
- type: textarea
id: description
attributes:
label: Problem Description
description: >
Please describe what problem the feature would solve, e.g. "I wish I could use pandas to ..."
placeholder: >
I wish I could use pandas to return a Series from a DataFrame when possible.
validations:
required: true
- type: textarea
id: feature
attributes:
label: Feature Description
description: >
Please describe how the new feature would be implemented, using psudocode if relevant.
placeholder: >
Add a new parameter to DataFrame, to_series, to return a Series if possible.

def __init__(self, ..., to_series: bool=False):
"""
Parameters
----------
...

to_series : bool, default False
Return a Series if possible
"""
if to_series:
return Series(data)
validations:
required: true
- type: textarea
id: alternative
attributes:
label: Alternative Solutions
description: >
Please describe any alternative solution (existing functionality, 3rd party package, etc.)
that would satisfy the feature request.
placeholder: >
Write a custom function to return a Series when possible.

def to_series(...)
result = pd.DataFrame(...)
...
validations:
required: true
- type: textarea
id: context
attributes:
label: Additional Context
description: >
Please provide any relevant Github issues, code examples or references that help describe and support
the feature request.
1 change: 0 additions & 1 deletion .github/SECURITY.md

This file was deleted.

8 changes: 5 additions & 3 deletions .github/actions/build_pandas/action.yml
Original file line number Diff line number Diff line change
Expand Up @@ -6,8 +6,8 @@ runs:

- name: Environment Detail
run: |
conda info
conda list
micromamba info
micromamba list
shell: bash -el {0}

- name: Build Pandas
Expand All @@ -17,4 +17,6 @@ runs:
shell: bash -el {0}
env:
# Cannot use parallel compilation on Windows, see https://github.com/pandas-dev/pandas/issues/30873
N_JOBS: ${{ runner.os == 'Windows' && 1 || 2 }}
# GH 47305: Parallel build causes flaky ImportError: /home/runner/work/pandas/pandas/pandas/_libs/tslibs/timestamps.cpython-38-x86_64-linux-gnu.so: undefined symbol: pandas_datetime_to_datetimestruct
N_JOBS: 1
#N_JOBS: ${{ runner.os == 'Windows' && 1 || 2 }}
27 changes: 27 additions & 0 deletions .github/actions/run-tests/action.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,27 @@
name: Run tests and report results
runs:
using: composite
steps:
- name: Test
run: ci/run_tests.sh
shell: bash -el {0}

- name: Publish test results
uses: actions/upload-artifact@v2
with:
name: Test results
path: test-data.xml
if: failure()

- name: Report Coverage
run: coverage report -m
shell: bash -el {0}
if: failure()

- name: Upload coverage to Codecov
uses: codecov/codecov-action@v2
with:
flags: unittests
name: codecov-pandas
fail_ci_if_error: false
if: failure()
17 changes: 8 additions & 9 deletions .github/actions/setup-conda/action.yml
Original file line number Diff line number Diff line change
Expand Up @@ -6,8 +6,8 @@ inputs:
environment-name:
description: Name to use for the Conda environment
default: test
python-version:
description: Python version to install
extra-specs:
description: Extra packages to install
required: false
pyarrow-version:
description: If set, overrides the PyArrow version in the Conda environment to the given string.
Expand All @@ -24,14 +24,13 @@ runs:
if: ${{ inputs.pyarrow-version }}

- name: Install ${{ inputs.environment-file }}
uses: conda-incubator/[email protected]
uses: mamba-org/provision-with-micromamba@v12
with:
environment-file: ${{ inputs.environment-file }}
activate-environment: ${{ inputs.environment-name }}
python-version: ${{ inputs.python-version }}
channel-priority: ${{ runner.os == 'macOS' && 'flexible' || 'strict' }}
environment-name: ${{ inputs.environment-name }}
extra-specs: ${{ inputs.extra-specs }}
channels: conda-forge
mamba-version: "0.24"
use-mamba: true
use-only-tar-bz2: true
channel-priority: ${{ runner.os == 'macOS' && 'flexible' || 'strict' }}
condarc-file: ci/condarc.yml
cache-env: true
cache-downloads: true
3 changes: 3 additions & 0 deletions .github/workflows/32-bit-linux.yml
Original file line number Diff line number Diff line change
Expand Up @@ -12,6 +12,9 @@ on:
paths-ignore:
- "doc/**"

permissions:
contents: read

jobs:
pytest:
runs-on: ubuntu-latest
Expand Down
6 changes: 6 additions & 0 deletions .github/workflows/assign.yml
Original file line number Diff line number Diff line change
Expand Up @@ -3,8 +3,14 @@ on:
issue_comment:
types: created

permissions:
contents: read

jobs:
issue_assign:
permissions:
issues: write
pull-requests: write
runs-on: ubuntu-latest
steps:
- if: github.event.comment.body == 'take'
Expand Down
13 changes: 7 additions & 6 deletions .github/workflows/asv-bot.yml
Original file line number Diff line number Diff line change
Expand Up @@ -9,8 +9,15 @@ env:
ENV_FILE: environment.yml
COMMENT: ${{github.event.comment.body}}

permissions:
contents: read

jobs:
autotune:
permissions:
contents: read
issues: write
pull-requests: write
name: "Run benchmarks"
# TODO: Support more benchmarking options later, against different branches, against self, etc
if: startsWith(github.event.comment.body, '@github-actions benchmark')
Expand All @@ -33,12 +40,6 @@ jobs:
with:
fetch-depth: 0

- name: Cache conda
uses: actions/cache@v3
with:
path: ~/conda_pkgs_dir
key: ${{ runner.os }}-conda-${{ hashFiles('${{ env.ENV_FILE }}') }}

# Although asv sets up its own env, deps are still needed
# during discovery process
- name: Set up Conda
Expand Down
6 changes: 6 additions & 0 deletions .github/workflows/autoupdate-pre-commit-config.yml
Original file line number Diff line number Diff line change
Expand Up @@ -5,8 +5,14 @@ on:
- cron: "0 7 1 * *" # At 07:00 on 1st of every month.
workflow_dispatch:

permissions:
contents: read

jobs:
update-pre-commit:
permissions:
contents: write # for technote-space/create-pr-action to push code
pull-requests: write # for technote-space/create-pr-action to create a PR
if: github.repository_owner == 'pandas-dev'
name: Autoupdate pre-commit config
runs-on: ubuntu-latest
Expand Down
Loading