jjzjj

subreddit

全部标签

python - Pandas ,对于一列中的每个唯一值,在另一列中获取唯一值

我有一个数据框,其中每一行都包含与单个Reddit评论(例如作者、subreddit、评论文本)相关的各种元数据。我想做以下事情:对于每个作者,我想获取他们在其中发表评论的所有subreddits的列表,并将此数据转换为pandas数据框,其中每一行对应一个作者,以及所有的列表他们发表评论的独特子版block。我目前正在尝试以下的一些组合,但无法理解:尝试1:group=df['subreddit'].groupby(df['author']).unique()list(group)尝试2:fromcollectionsimportdefaultdictsubreddit_dict=d

mongodb - 如何在 MongoDB 中进行 HAVING COUNT?

我的文档如下所示:{"_id":ObjectId("5698fcb5585b2de0120eba31"),"id":"26125242313","parent_id":"26125241841","link_id":"10024080","name":"26125242313","author":"gigaquack","body":"blogging=creativewriting","subreddit_id":"6","subreddit":"reddit.com","score":"27","created_utc":"2007-10-2218:39:31"}我要做的是创建一个

mongodb - 如何在 MongoDB 中进行 HAVING COUNT?

我的文档如下所示:{"_id":ObjectId("5698fcb5585b2de0120eba31"),"id":"26125242313","parent_id":"26125241841","link_id":"10024080","name":"26125242313","author":"gigaquack","body":"blogging=creativewriting","subreddit_id":"6","subreddit":"reddit.com","score":"27","created_utc":"2007-10-2218:39:31"}我要做的是创建一个