Skip to content

With pandas 2.0, custom time handling might not be necessary any more #242

@mikapfl

Description

@mikapfl

Describe the bug

Most of the time handling functionality of scmdata is a duplication of already existing functionality in pandas. Given that pandas is a much more mature project, the scmdata functionality is also severely lacking compared to pandas.

An example is the resampling functionality of pandas. The code for resampling is super concise, and inter- and extrapolation is available. The equivalent of pandas'

ts.resample("5Min").sum()

in scmdata will be much more involved, simply because run.interpolate takes the target_times as a list of time values.

As a result, I found it much easier to convert ScmRun objects into pandas DataFrames with proper pd.DateTimeIndex'es when working with even the simplest time-related problems. The question is then: why is there a proprietary re-implementation of pandas' functionality in scmdata at all? Wouldn't it be better to get rid of it and leverage pandas' functionality?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions