python, week 5

i used code from here to put all the words from the first executive order into a set, which is a way to get all the unique words in the document.

## from allison parrish's http://www.decontextualize.com/teaching/rwet/simple-models-of-text/ import sys words = set() for linein sys.stdin: line = line.strip() line_words = line.split() for wordin line_words: words.add(word) for wordin words: print word

which produced “words” separated by line breaks. here are some interesting sections:

all

United

burden

PENDING

out

purchasers

Patient

for

enforceable

availability

HOUSE,

health

imperative

Nothing

benefit

Human

repeal,

otherwise

individuals,

control

Constitution

unwarranted

fiscal

head

with

legislative

Procedure

CARE

commerce

agency

Act

authorities

such

WHITE

law

affect:

impair

does

the

insurers,

okay so obviously some of these would be neat poems, so i tried to join them:

import sys for linein sys.stdin: line = line.strip() output = " ".join(line) print output
python, week 5

hmmmm noooo…

hmmmm nooooooo… okay, new activity: replacing the executive order with these poems.

i’m doing this manually for now since it would involve a bunch of regex, but i’ll record the steps here:

replace all instances of “Minimizing the Economic Burden of the Patient Protection and Affordable Care Act Pending Repeal” with first poem above, “all United burden,” in the style in which the original text appears (so, with .title() or .upper()) when sections begin, keep the text naming the section as such (“Section 1”, “Sec. 2”, etc.) but replace body of the section with the next poem above. remove newlines from poems above so the words flow like sentences, but don’t change case, punctuation, etc. fill sections for as many poems as were originally picked out from the set. delete sections that don’t have an accompanying poem.

this feels very related to a project i did in jer’s class last year where i replaced “mortgage” language with “data” language in hank paulson’s 2008 announcement about the economy. python woulda helped with that/made it better. anyway, executive order results here , original here .

another thing i was working on was figure out how to clean up the file without going through manually. these are things i did in the interpreter. i wonder if there’s a way to say if 'space' char appears > or = 2 times, replace it with ' ‘? it’d also be cool to figure out how to split on html tags so i don’t have to manually delete those. maybe this will be useful later.

for linein lines: line = line.strip().replace('', ' ') line = line.strip().replace(' ', ' ') line = line.strip().replace('', ' ') line = line.strip().replace(' ', ' ') line = line.strip().replace('', ' ') line = line.strip().replace(' ', ' ') line = line.strip().replace('', ' ') line = line.strip().replace(' ', ' ') print line

python, week 5

Trending Articles

[奇怪机翻组] 双梦相牵 / ふたりの夢もち [RJ01259078] [WebRip] [1080P HEVC-10Bit AAC 2.0]...

HONDA CITY VTI-S 菜單分享

#新闻拍一拍# 新的摩尔定律：黄氏定律

一如既往的痴情能否打动月瓶金蝎？ (豆瓣月亮水瓶小组)

求購按摩椅~'~

「粉红」不是霸凌辜莞允杠部落客：我爽在哪？

Intel 7-10代集成显卡驱动31.0.101.2137完整版

涉Gotbit加密货币市场操纵台男纽约被捕

臺灣法治會計學會2025年第三季研討會

不靠姊姊！張柏芝弟弟開計程車維生

关门一家亲：习远平、张澜澜、徐才厚

剑指offer——24.二叉树中和为某一值的路径

苏珊米勒日晕05.11｜狮子鼓励孩子；处女相信自己 (豆瓣 SUSAN MILLER小组)

【台積電IT卓越新戰略5】台積IT組織5年三次大調整，要靠平臺工程讓DevOps創新再加速

【日语无字】春之钟.Haru.no.kane.1985.JAP.vhsrip.NoSub.by.xiongzaixia&vivi

美籍老公不讓步李愛綺兒子念公立小學

爆杨兰兰对于朦胧一见倾心泄露亲爹习近平致命机密？【阿波罗网报道】

湖州师范学院音乐学院开发的 Kontakt 8 明代魏氏乐琵琶/瑟/月琴音源即将发布

LameXP 4.21.2382 免安裝中文版 - MP3音樂轉檔軟體

免费翻墙节点大全