Programming > XML
extract data from large xml files using C#
iPhone file syncing app
web parser (scheduled): NING/Joomla-based to MySQL
Hello friends! ---------------------------------------------------------------------- SOURCE WEBWe need to crawling community based on NING platform: - extract trainers (with personal data) http://www. worldonenesscommunity. com/page/find-oneness-trainers< / a> (NING-based) andhttp://www. onenessuniversity. org/index. php/find-trainer-events/find-oneness-trainers< / a> (Joomla-based) - extract givers (with personal data) http://www. worldonenesscommunity. com/page/find-deeksha-givers< / a> (NING-based) - extract events (all location) http://www. worldonenesscommunity. com/page/find-events-by-location< / a> (NING-based) and http://www. onenessuniversity. org/index. php/find-trainer-events/find-oneness-events< / a> (Joomla-based) - extract links to featuredVIDEOShttp://www. worldonenesscommunity. com/video/video/listFeatured< / a> (NING-based) PHOTOShttp://www. worldonenesscommunity. com/photo/photo/listFeatured< / a> (NING-based) NOTEShttp://www. worldonenesscommunity. com/notes/index/allNotes< / a> (NING-based) - extract user data from profile- extract group description- extract posts, eventsone of our webs based on JomSocial (joomla community extension). Please see example for understanding work at http://www. jomsocial. com/community. html< / a>-------------------------------------------------------------------- REQUIREMENTS TO CODE1. An easy job for anyone rails/PHP competent. You may use every tool where You are master. We not set restriction: PHP, Rail, and so on... We only need that final code are: FASTWELL DOCUMENTED2. Results must exported into server-side MySQL (or XML document) on our linux-based server. 3. Code need to be created by " sub-blocks/functions " for example for different purposes: - refresh trainers list on worldonenesscommunity. com once a 40 min- refresh events on onenessuniversity. org list once a 30 min- refresh featured videos- and so on. ---------------------------------------------------------------------- REQUIREMENTS TO APLYERYou must have previous experience with website crawling to apply for job. PLEASE INCLUDE # OF YEARS EXPERIENCE YOU HAVE. THIS IS AN ENTRY LEVEL JOB WITH THE POSSIBILITY OF LONG TERM RELATIONSHIP. P. S. I must say that Ning nas no API for this goals at all: http://developer. ning. com/docs/ningapi/1.0/overview/introduction. html#what-you-cannot-do< / a> and roadmap are not for optimizmhttp://developer. ning. com/docs/ningapi/1.0/changelog/2011-11-18. html< / a>
Part Time Developer
Visualizer in Java and XML
Need ANDROID QU1Z game app
SEO Developer
Ipad Application
We need an Ipad application that works like thishttp://flippingbook. com/demos/publisher / < / a>functions required: - both vertical (1 page) and horizontal (2 pages) mode- same page transictions as in the example given- search- go to page.. - next - previousThe pdf included will be taken from the internet and saved in the application. When there will be any update, it will download the latest version (and store it, so the application/pdf can be read offline) If you want to apply for this job, please --> provide a fixed price <-- and time to develop, and a couple of examples of your previous works. The application should look as similar as possible to the example (http://flippingbook. com/demos/publisher /), same effects and usability. The server part will be given. It will work like this: At the first launch the app will call a page on a server and will receive an xml response with the latest update time and link to the pdf. Then the user will be asked if he wants to upload to the newer version. Thank you
Backend Tool To Scrape Data & Grab API Data & Store in SQL
======================================I have a large scale project in mind and I 'd like YOU to help me with part 1. All going well, this could evolve into a lot more work for you. ======================================We are going to start off with a tool that scrapes data from Google's keyword tool (https://adwords. google. com/select/KeywordToolExternal). We will store that in a database table. Next, we will use Namecheap's API to check for domain availability, marking the results back in the database. Pretty simple. To make it a bit more smart, scraping will made through private proxies to spread the load. We will also de-duplicate repeat keywords. That's the summary; full details are provided in the details section. Remember, this is just part 1 to get the ball rolling. The tool gets seriously cool later on:) Part 1 really doesn't have a UI. But later on it will have some basic features. However, design isn't important since the tool is not public-facing. Serious bonus points if you were able to make use of something like Twitter's Bootstrap library (http://twitter. github. com/bootstrap /) and Glyphicon's icons (http://glyphicons. com /) to make it user friendly and slick. I 'd welcome use of PHP and MySQL for this task as I am able to code a little and would then be able to make tiny tweaks myself. If you have a strong preference for another language, then just let me know. You will notice that I have a glowing profile on oDesk, so rest assured that I am great to work with:) All applicants must understand the task details before submitting their bid. Please also include the word " bananas " in your bid so that I know you have read this! Template answers will be ignored! ======================================Full Specification: ======================================The tool starts off with a large empty textarea box. This is where users will enter in a line separated list of keywords (they 'll typically put in around 50 each time). A submit button then adds those keywords into a 'queue ' (ie, into the database with a status of 0 = newly added, we should also store the timestamp of entry, and give each keyword an incremental ID number). The form then refreshes blank and that's all the user sees. Next, the tool needs to fetch data from Google's keyword tool. https://adwords. google. com/select/KeywordToolExternal< / a> is the URL. They use CAPTCHAs if you are not logged in, so we need to have a dummy Google account that is logged in with a cookie to be able to fetch the data. The keywords will go into the " word or phrase " textarea. We also need to select only the " [Exact] " " match type " option. Location should be only " United States " and the rest as per default. We are going to get back a bunch of results broken down into two sections; " search terms " and " keyword ideas ". We are interested in booth. We need to collect and store: " global searches ", " local searches ", and " CPC ". We will also have a database field to store keyword type; 1 for " search terms ", 0 for " keyword ides ". Google provides the resulting data in CSV/XML which might be easier to parse into the database than scraping the HTML table. We are then going to do a domain availability lookup for keywords where local search volume is >1000 and CPC is >$1. We will use Namecheap's free API: http://developer. namecheap. com/docs/doku. php? id=api-reference: domains: check< / a> (I can provide you with the API access code/key, or you can create your own free account for testing). We need to check for availability for. com /. net /. org domain names for the keyword phrase, removing word spacing. So if the keyword is " free ipod nano " then the domain we want to check for is " freeipodnano. com ", " freeipodnano. net " and " freeipodnano. org ". We will have three fields in the database and use 0/1 for availability. That's the process. But we need to add in an additional process that limits the amount of scraping we are doing from our server's IP. I will provide a list of around 50 private proxies (http://proxybonanza. com /). We should make use of them all to spread the load. I suggest only submitting 5 keywords per IP each 15-minutes. Maybe be need to use something like a CRON job to process another batch of KWs every 15-minutes? It would be really great if we could even find a way for those 15-minutes to be less predictable (ie. generating a random 0-5 minute time to add on to the 15, or perhaps sleep/rand would be useful?). The scraper should do what it can to appear as close to a real user as possible (so using a browser user-agent, for example). We don't need to worry about this kind of balancing when performing availability checks with Namecheap. Lastly, we can also filter duplicates. So when the user submits their list of keywords, we can ignore those already in the database. Also, when we fetch for suggestions, we can ignore those that are already present.
Web Developer (PHP, MySQL, CSS, HTML, Javascript)
Aircraft Management Website Development
Joomla developer needed to create Aircraft Management website.
MS SQL 8 Developer
.Net Framework C# Developer
To View More Detail on This Position and Apply, You Must Click on the Link Below To Access our Company Careers and Recruiting Website: http://tbe. taleo. net/NA5/ats/careers/requisition. jsp? org=DEVFACTORYandcws=1andrid=92< / a>The DevFactory. Net Framework C# Developer role pays $13.5/hr. DevFactory is looking for several. Net Framework developers to create and augment several utilities to automate our quality engineering processes. The development will be in C#. Initial assignments will likely involve parsing XML files into. Net framework objects and TestComplete integration, but will branch into additional automation projects as well. DevFactory is looking to hire several python programmers to implement, maintain, and test these python utilities. Requirements:. Net FrameworkC# programming languageFundamental XML knowledge experienceTestComplete/TestExecute desired (but not required) Microsoft certified professional on. Net Framework desired (but not required) In addition, this role has the following non-technical requirements: English - all candidates must be able to speak and write capably in English. English need not be the native first language, but it should be sufficient to enable technical discussion. Video - candidates must have the computer hardware and networking bandwidth to conduct a seamless video skype conversation for team communication. Ongoing use of webcam for odesk billing and skype video are required. Full Time - This job is only offered for a Full Time basis (40 hrs/wk). Qualification for this role begins with an unpaid testing phase where candidates are required to provide verifiable identification, complete online skills testing, and attend a brief Skype video interview. The testing phase lasts for no more than a week, and highly motivated candidates can complete all requirements of the testing phase within a few hours. Candidates who pass our testing phase will be offered a one week fixed price trial assignment. Candidates who successfully complete our trial assignment within the given time frame will be offered the full time position at 40 hrs/wk at the jobs hourly rate. Those who show progress and dedication during the trial phase but fail to complete the assignment successfully may be offered an alternate position within DevFactory. DevFactory will make you $1 fixed price offer to begin the testing phase. Detailed instructions for completing the testing phase will be sent promptly upon your acceptance of the testing phase offer. Please note that the offer is not monetarily contractual, but simply allows us to bring you into the appropriate recruiting team room.
RSS XML scraping
Here's a list of 800 podcasts: http://www. npr. org/rss/podcast/podcast_directory. php? type=titleandvalue=all< / a>. I need the following items extracted from this page: 1) Title of each feed2) Provider3) RSS feed URL of each feed. If you click into the link on each Title, there should be a link to the XML file of the RSS feed (see the orange POD icon). I need this URL4) most recent PubDate. If you view the source of the RSS feed XML, you will see a


